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TITLE OF THE INVENTION 

CARCINOEMBRYONIC ANTIGEN FUSIONS AND USES THEREOF 

FIELD OF THE INVENTION 
5 The present invention relates generally to the therapy of cancer. More specifically, the 

present invention relates to polynucleotides encoding fusion proteins wherein the fusion proteins 
comprise at least a portion of the tumor associated polypeptide carcinoembryonic antigen. The present 
invention also provides recombinant vectors and hosts comprising said polynucleotides, purified fusion 
proteins and methods for enhancing an immune response against CEA using the compositions and 
10 molecules disclosed herein. 

BACKGROUND OF THE INVENTION 

The immunoglobulin superfamily (IgSF) consists of numerous genes that code for 
proteins with diverse functions, one of which is intercellular adhesion. IgSF proteins contain at least one 
15 Ig-related domain that is important for maintaining proper intermolecular binding interactions. Because 
such interactions are necessary to the diverse biological functions of the IgSF members, disruption or 
aberrant expression of many IgSF adhesion molecules has been correlated with many human diseases. 

The carcinoembryonic antigen (CEA) belongs to a subfamily of the Ig superfamily 
consisting of cell surface glycoproteins. Members of the CEA subfamily are known as CEA-related cell 
20 adhesion molecules (CEACAMs). In recent scientific literature, the CEA gene has been renamed 
CEACAM5, although the nomenclature for the protein remains CEA. Functionally, CEACAMs have 
been shown to act as both homotypic and heterotypic intercellular adhesion molecules (Benchimol et aL, 
Cell 57: 327-334 (1989)). In addition to cell adhesion, CEA inhibits cell death resulting from 
detachment of cells from the extracellular matrix and can contribute to cellular transformation associated 
25 with certain proto-oncogenes such as Bcl2 and C-Myc (see Berinstein, J. Clin OncoL 20(8): 2197-2207 
(2002)). Sequences coding for human CEA have been cloned and characterized (U.S. Patent No. 
5,274,087; U.S. Patent No 5,571,710; and U.S. Patent No 5,843,761. See also Beauchemin et aL, Mol. 
Cell Biol 7:3221-3230 (1987); Zimmerman et al., Proc. Natl Acad. Sci. USA 84:920-924 (1987); 
Thompson et al. Proc. Natl Acad. ScL USA 84(9):2965-69 (1987)). 
I 30 Normal expression of CEA has been detected dining fetal development and in adult 

colonic mucosa. CEA overexpression was first detected in human colon tumors over thirty years ago 
(Gold and Freedman, J. Exp. Med. 121:439-462 (1965)) and has since been found in nearly all colorectal 
tumors. Additionally, CEA overexpression is detectable in a high percentage of adenocarcinomas of the 
pancreas, liver, breast, ovary, cervix, and lung. Because of its prevalence in these tumor types and 
35 limited normal tissue expression, CEA is considered a self tumor-associated antigen and a target for 
active and passive immunotherapy. Recent clinical data have established that different vaccine strategies 
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can generate human B and T cells specific for CEA, providing additional evidence that CEA is a target 
for molecular and immunological intervention for treatment of these cancer types. 

Therapeutic approaches targeting CEA include the use of anti-CEA antibodies (see 
Chester et al., Cancer Chemother. Pharmacol 46 (Suppl): S8-S12 (2000)), as well as CEA-based 
5 vaccines (for review, see Berinstein, supra). The development and commercialization of many vaccines 
have been hindered by difficulties associated with obtaining high expression levels of exogenous genes. 
Success of DNA-based vaccines has also been hindered by an inability to generate an immune response 
of sufficient magnitude in treated individuals. DNA vaccines targeting various proteins have been 
developed, but the immune responses induced to date by DNA vaccines have been relatively weak 
10 compared with conventional vaccines. 

The ease of DNA manipulation has offered an opportunity to develop gene fusion 
vaccine strategies in which antigens are linked to various immunoenhancing elements. Enhancement of 
immune response to target antigens has been demonstrated in animal models by vectors encoding 
antigens fused to heat shock protein (HSP) 70 (Liu et al., J. Virol. 74: 2888-94 (2000); Cheng et al. J. 
15 Immunol. 166: 6218-26 (2001); Chen et al., Cancer Res. 60: 1035-42 (2000)), to Fc portion of IgGl 
(You et al., 7. Immunol. 165: 4581-92 (2000)), to lysosome-associated membrane protein (LAMP) (Su et 
al., Cancer Res. 62: 5041-48 (2002)), and universal Th epitope from tetanus toxin (Renard et al., J. 
Immunol. 171:1588-95 (2003); King et al., Nature Med. 4: 1281-86 (1998); Lund et al., Cancer Gene 
Then 10: 365-76 (2003); Padua et al., Nature Med. 9(11): 1413-17 (2003); Savelyeva et al., Nature 
20 BiotechnoL 19: 760-64 (2001); Wahren et al., WO 2004/092216). The enhancement of immune 
responses to target antigens is particularly relevant for cancer vaccines in view of the limited 
immunogenicity of tumor antigens and of the need to overcome tolerance to exert effective antitumor 
effects. 

Therefore, despite the identification of the wild-type nucleotide sequences encoding 
25 CEA proteins described above, it would be highly desirable to develop a vaccine which is capable of 
eliciting an enhanced CEA-specific immune response relative to a wild-type full-length CEA cDNA, 
when delivered to a mammal. It would also be desirable to develop methods for treating or preventing 
CEA-associated cancers which utilize nucleic acid molecules or proteins that safely and effectively 
potentiate a CEA-specific immune response. 

30 

SUMMARY OF THE INVENTION 

The present invention provides polynucleotides encoding fusion proteins wherein the 
fusion proteins comprise at least a portion of the tumor associated polypeptide carcinoembryonic antigen, 
fused to a substantial portion of an immunoenhancing element. In preferred embodiments, the CEA 
35 portion of the encoded CEA fusion protein is deleted of its C-terminal anchoring domain. In other 
preferred embodiments, the immunoenhancing element is the minimized domain of tetanus toxin 
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fragment C (DOM), or substantial portion thereof. The present invention also provides recombinant 
vectors, including but not limited to, adenovirus and plasmid vectors, comprising said polynucleotides 
and host cells comprising said recombinant vectors. Also provided herein are purified fusion proteins 
encoded by invention polynucleotides. 
5 The present invention further provides methods for inhibiting the development of a 

cancer in a mammal, or treating or minimizing an existing cancer, by eliciting an immune response to 
CEA, such methods comprising administering a vaccine or pharmaceutical composition comprising a 
CEA fusion protein or a nucleotide sequence encoding a CEA fusion protein, as described herein. In 
preferred embodiments of the methods herein, the immune response is enhanced relative to the response 

10 elicited by a wild-type CEA. 

As used throughout the specification and in the appended claims, the singular forms "a," 
"an," and "the" include the plural reference unless the context clearly dictates otherwise. 

As used throughout the specification and appended claims, the following definitions and 
abbreviations apply: 

15 The term "promoter" refers to a recognition site on a DNA strand to which the RNA 

polymerase binds. The promoter forms an initiation complex with RNA polymerase to initiate and drive 

transcriptional activity. The complex can be modified by activating sequences termed "enhancers" or 

inhibiting sequences termed "silencers". 

The term "cassette" refers to a nucleotide or gene sequence that is to be 
20 expressed from a vector, for example, the nucleotide or gene sequence encoding an hCEA-DOM fusion 

protein. In general, a cassette comprises a gene sequence that can be inserted into a vector, which in 

some embodiments, provides regulatory sequences for expressing the nucleotide or gene sequence. In 

other embodiments, the nucleotide or gene sequence provides the regulatory sequences for its expression. 

In further embodiments, the vector provides some regulatory sequences and the nucleotide or gene 
25 sequence provides other regulatory sequences. For example, the vector can provide a promoter for 

transcribing the nucleotide or gene sequence and the nucleotide or gene sequence provides a transcription 

termination sequence. The regulatory sequences that can be provided by the vector include, but are not 

limited to, enhancers, transcription termination sequences, splice acceptor and donor sequences, introns, 

ribosome binding sequences, and poly(A) addition sequences. 
30 The term "vector" refers to some means by which DNA fragments can be introduced into 

a host organism or host tissue. There are various types of vectors including plasmid, virus (including 

adenovirus), bacteriophages and cosmids. 

The term "first generation," as used in reference to adenoviral vectors, describes 

adenoviral vectors that are replication-defective. First generation adenovirus vectors typically have a 
35 deleted or inactivated El gene region, and preferably have a deleted or inactivated E3 gene region. 
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The designation "pVU/hCEAopt" refers to a plasmid construct, disclosed herein, 
comprising the CMV immediate-early (IE) promoter with intron A, a full-length codon-optimized human 
CEA gene, bovine growth hormone-derived polyadenylation and transcriptional termination sequences, 
and a minimal pUC backbone (see EXAMPLE 1). The designation "pVU/hCEA" refers to a construct 
5 essentially as described above, except the construct comprises a wild-type human CEA gene instead of a 
codon-optimized human CEA gene. 

The designation "pVU/hCEA-DOM" refers to a plasmid construct, disclosed herein, 
comprising the CMV immediate-early (IE) promoter with intron A, a human CEA gene devoid of its GPI 
anchor coding sequence, fused at its C-terminal end to the N-terminal domain of Fragment C of tetanus 
10 toxoid (DOM), bovine growth hormone-derived polyadenylation and transcriptional termination 
sequences, and a minimal pUC backbone (EXAMPLE 1). 

The designation "pVlJ/hCEAopt-LTBopt" refers to a construct essentially as described 
immediately above, except the construct comprises a codon-optimized human CEA gene devoid of its 
GPI anchor coding sequence instead of the corresponding portion of the wild-type human CEA gene, 
15 fused at its C-terminal end to a nucleotide sequence encoding DOM that has also been codon-optimized 
for high-level expression in human cells.. 

The designation "pVU/rhCEAopt-DOMopt" refers to a construct essentially as 
described above except that the human codon-optimized CEA gene is replaced with a rhesus monkey 
CEA gene, codon-optimized for high-level expression in human cells. 
20 The designation "pVU/hCEA-FcIgG" refers to a plasmid construct,, disclosed herein, 

comprising the CMV immediate-early (IE) promoter with intron A, a human CEA gene devoid of the GPI 
anchor coding sequence, fused at its C-terminal end to the heavy fragment of constant chain of 
immunoglobulin Gl, bovine growth hormone-derived polyadenylation and transcriptional termination 
sequences, and a minimal pUC backbone. (EXAMPLE 1). pVU/hCEAopt-FcIgGopt refers to a 
25 construct essentially as described, except the nucleotide sequences encoding CEA and FcIgG have been 
codon-optimized for high-level expression in human cells. 

The designations "Ad5/hCEAopt" and "Ad5/hCEA" refer to two constructs, disclosed 
herein, which comprise an Ad5 adenoviral genome deleted of the El and E3 regions. In the 
"Ad5/hCEAopt" construct, the El region is replaced by a codon-optimized human CEA gene in an El 
30 parallel orientation under the control of a human CMV promoter without intron A, followed by a bovine 
growth hormone polyadenylation signal. The "AdS/hCEA" construct is essentially as described above, 
except the El region of the Ad5 genome is replaced with a wild-type human CEA sequence. The 
designation "Ad5/hCEAopt-DOMopt" refers to an Ad5 construct, essentially as described above, except 
that the codon-optimized human CEA sequence is devoid of the GPI anchor coding sequence and is fused 
35 at its C-terminus to a codon-optimized nucleotide sequence encoding DOM. Construction of adenovirus 

vectors comprising various CEA fusions is described in EXAMPLE 2. 

4 
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The abbreviation "DOM" refers generally to the N-terminal domain of fragment C of 

tetanus toxoid. 

As used herein, a "fusion protein" refers to a protein having at least two polypeptides 
covalently linked in which one polypeptide comes from one protein sequence, or domain and the other 

5 polypeptide comes from a second protein sequence or domain. The fusion proteins of the present 
invention comprise a CEA polypeptide or fragment or variant thereof, and a second polypeptide, which 
comprises a substantial portion of an immunoenhancing element, which, in some cases, is a bacterial 
toxin. The CEA polypeptide, fragment or variant thereof may be a human CEA or CEA homolog from 
another species. The polypeptides that comprise the fusion protein are preferably linked N-terminus to 
10 C-terminus. The CEA polypeptide and the toxin subunit can be fused in any order. In some 
embodiments of this invention, the C-terminus of the CEA polypeptide is fused to the N-terminus of the 
toxin subunit, as exemplified in FIGURE 1 A. However, fusion proteins in which the immunoenhancing 
element is fused to the N-terminus of the CEA polypeptide are also contemplated. The term "CEA 
fusion protein" is intended to be a general term which refers to a fusion as described above, which 

15 comprises a CEA polypeptide or fragment or variant thereof fused to a polypeptide comprising an 

immunoenhancing element. 

The term "CEA-DOM fusion" refers to a nucleic acid sequence in which at least a 
portion of the CEA gene is fused to a substantial portion of the minimized domain of tetanus toxin 
fragment C, unless the context clearly dictates that said term refers to the protein sequence. The term 
20 "CEA-DOM fusion protein" refers to a polypeptide encoded by a CEA-DOM fusion as described. The 
terms "CEA-DOM fusion" and "CEA-DOM fusion protein" are also understood to refer to fragments 
thereof, homologs thereof, and functional equivalents thereof (collectively referred to as "variants"), such 
as those in which one or more amino acids is inserted, deleted or replaced by other amino acid(s). The 
CEA-DOM fusions of the present invention, upon administration to a mammal such as a human being, 
25 can stimulate an immune response by helper T cells or cytotoxic T cells, or stimulate the production of 
antibodies at least as well as a "wild-type" CEA sequence. In preferred embodiments of the invention, 
the CEA-DOM fusion can enhance the immune response as compared to a wild-type CEA. 

The abbreviation "AD" refers to the anchoring domain of a CEA gene or protein. The 
anchoring domain of the wild-type human CEA is located from about amino acid 679 to about amino 
30 acid 702 of SEQ ID NO.30. An amino acid sequence encoding a human CEA devoid of its anchoring 
domain, herein designated CEAAAD, is set forth in SEQ ID NO:3. 

The term "treatment" refers to both therapeutic treatment and prophylactic or 
preventative measures. Those in need of treatment include those already with the disorder as well as 
those prone to have the disorder or those in which the disorder is to be prevented. 
35 A "disorder" is any condition that would benefit from treatment with the molecules of 

the present invention, including the nucleic acid molecules described herein and the fusion proteins that 

5 
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are encoded by said nucleic acid molecules. Encompassed by the term "disorder" are chronic and acute 
disorders or diseases including those pathological conditions which predispose the mammal to the 
disorder in question. The molecules of the present invention are intended for use as treatments for 
disorders or conditions characterized by aberrant cell proliferation, including, but not limited to, 
5 pancreatic cancer, liver cancer, breast cancer, colorectal cancer, ovarian cancer, cervical cancer, and lung 
cancer. 

The term "effective amount" means sufficient vaccine composition is introduced to 
produce the adequate levels of the polypeptide, so that an immune response results. One skilled in the art 

recognizes that this level may vary. 
10 A "conservative amino acid substitution" refers to the replacement of one amino acid 

residue by another, chemically similar, amino acid residue. Examples of such conservative substitutions 
are: substitution of one hydrophobic residue (isoleucine, leucine, valine, or methionine) for another; 
substitution of one polar residue for another polar residue of the same charge (e.g., arginine for lysine; 

glutamic acid for aspartic acid). 
15 "hCEA" and "hCEAopt" refer to a human carcinoembryonic antigen and a human codon- 

optimized carcinoembryonic antigen, respectively. 

"Substantially similar" means that a given nucleic acid or amino acid sequence shares at 
least 75%, preferably 85%, more preferably 90%, and even more preferably 95% identity with a 
reference sequence. In the present invention, the reference sequence can be relevant portions of the wild- 
20 type human CEA nucleotide or amino acid sequence, or the wild-type nucleotide or amino acid sequence 
of an immunoenhancing element or subunit thereof, such as DOM, as dictated by the context of the text. 
The reference sequence may also be, for example, the wild-type rhesus monkey CEA sequence. Thus, a 
CEA protein sequence that is "substantially similar" to the wild-type human CEA protein or fragment 
thereof will share at least 75% identity with the relevant fragment of the wild-type human CEA, along the 
25 length of the fragment, preferably 85% identity, more preferably 90% identity and even more preferably 
95% identity. Whether a given CEA, DOM, or other protein or nucleotide sequence is "substantially 
similar" to a reference sequence can be determined for example, by comparing sequence information 
using sequence analysis software such as the GAP computer program, version 6.0, available from the 
University of Wisconsin Genetics Computer Group (UWGCG). The GAP program utilizes the alignment 
30 method of Needleman and Wunsch (J. Mol. Biol. 48:443, 1970), as revised by Smith and Waterman 
{Adv. Appl. Math. 2:482, 1981). 

A "substantial portion" of a gene, variant, fragment, or subunit thereof, means a portion 
of at least 50%, preferably 75%, more preferably 90%, and even more preferably 95% of a reference 
sequence. 

35 "Immunoenhancing element" refers to a portion of the CEA fusion proteins of the 

present invention which is capable of stimulating or enhancing the immune response to the associated 
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CEA protein, relative to full-length wild-type CEA. Lumunoenhancing elements of the present invention 
are selected from the group consisting of: heat shock protein (HSP) 70, lysosome-associated membrane 
protein (LAMP), fragment C of tetanus toxoid (FrC), the N-terminal domain of FrC (DOM), the heavy 
fragment of constant chain of immune globulin Gl (FcIgG), the vesicular stomatitis virus glycoprotein 
5 (VSV-G), cholera toxin (CT) from Vibrio cholerae, and heat labile enterotoxin of Kcoli (LT). The term 
"immunoenhancing element" is used interchangeably herein with the term "adjuvant." 

A "gene" refers to a nucleic acid molecule whose nucleotide sequence codes for a 
polypeptide molecule. Genes may be uninterrupted sequences of nucleotides or they may include such 
intervening segments as introns, promoter regions, splicing sites and repetitive sequences. A gene can be 
10 either RNA or DNA. A preferred gene is one that encodes the invention peptide. 

The term "nucleic acid" or "nucleic acid molecule" is intended for ribonucleic acid 
(RNA) or deoxyribonucleic acid (DNA), probes, oligonucleotides, fragment or portions thereof, and 
primers. DNA can be either complementary DNA (cDNA) or genomic DNA, e.g. a gene encoding a CEA 
fusion protein. 

15 "Wild-type CEA" or "wild-type protein" or "wt protein" refers to a protein comprising a 

naturally occurring sequence of amino acids or variant thereof. The amino acid sequence of wild-type 
human CEA is shown in FIGURE 7E (SEQ ID NO:30). The amino acid sequence of the wild-type rhesus 
monkey CEA was previously described (WO 2004/072287, SEQ ID NOs: 33 and 34, FIGURES 7C-7D). 

"Wild-type CEA gene" refers to a gene comprising a sequence of nucleotides that 
20 encodes a naturally occurring CEA protein, including proteins of human origin or proteins obtained from 
another organism, including, but not limited to, other mammals such as rat, mouse and rhesus monkey. 
The nucleotide sequence of the human CEA gene is available in the art {supra). See also Beauchemin et 
al., Mol Cell. Biol. 7:3221-3230 (1987); Zimmerman et al., Proc. Natl. Acad. ScL USA 84:920-924 
(1987); and Thompson et al. Proc. Natl. Acad. ScL USA 84(9):2965-69 (1987). The nucleotide sequence 
25 of the wild-type rhesus monkey gene is shown in FIGURES 7A-7B. 

The term "mammalian" refers to any mammal, including a human being. 
The abbreviation "Ag" refers to an antigen. 

The abbreviations "Ab" and "mAb" refer to an antibody and a monoclonal antibody, 

respectively. 

30 The abbreviation "ORF" refers to the open reading frame of a gene. 



BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1. Panel A shows a schematic representation of the CEA fusion proteins used 
in this study. Vectors expressing the CEA fusion proteins were derived from plasmid pVUns as 
35 described in EXAMPLE 1. The constructs comprise a CEA nucleotide sequence from nt 1 to nt 2037 
with a net deletion of 64 aa corresponding to the GPI anchoring sequence and express CEA from aa 1 to 
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aa 679. The sequence coordinates of each protein fused to CEA are also indicated. Panel B shows 
expression of pVU-derived constructs in transfected cells. HeLa cells were transfected with plasmids 
pVU/CEA-VSV-G, pVU/CEA-FcIgG, pVU/CEA-DOM, pVU/CEA-HSP70, pVU/CEA-LAMP, or 
pVU/CEA and processed for Western blot analysis as described in EXAMPLE 4. The specificity of the 
5 antibody used for Western blot is indicated. The CEA protein is indicated (black arrow). The positions 
of molecular size standards (in kilodaltons) are also shown. 

| FIGURE 2 shows a comparison of expression efficiency of the CEA fusion constructs. 

HeLa cells were transfected with the indicated plasmids and CEA derived protein present in cell lysates 
(A) and supematants (B) was measured by ELISA as described in EXAMPLE 4. Results obtained are 
10 representative of two independent experiments. 

FIGURE 3 shows a comparison of immunogenicity of different constructs encoding CEA 
fusion proteins. C57BL/6 mice were electroporated intramuscularly with a 5 or 50 ug/dose of the 
indicated plasmids. Injections were carried out at days 0 and 14. Panel A The number of IFNy-secreting 
T cells in PBMC in each individual mouse was determined using a pool of peptides covering aa 497-703 
15 (pool D) as described in EXAMPLES 5 and 10. Average number of IFNY-secreting T cells are also 
shown (fdled circles). SFC values of the pVU/CEA-DOM, and pVU/CEA-FcIgG are significantly 
different from those of pVl J/CEA. Panel B. Antibody titer was measured by ELISA using purified CEA 
as substrate. Average values of each cohort immunized with 50 jxg dose of the indicated plasmid are 
shown. Titers that are significantly different from those of mice injected with pVl J/CEA are indicated 
20 with an asterisk. 

FIGURE 4 shows the induction of CEA-specific immune responses in CEA transgenic 
mice. Groups of 12 CEA transgenic mice were immunized with plasmid DNA (50 jig/dose 
electroinjected in the quadriceps muscle) or Adenovirus vectors (109 vp/dose) carrying the codon usage 
optimized cDNA of CEA, CEA-DOM or CEA-FcIgG. CEA-specific CD8 + T cells elicited by the 

25 DNA/DNA (A) and Ad/Ad (C) immunization regimen were measured by intracellular IFNy staining on 
PBMC of each immunized mouse. The average values for each cohort are also shown (filled circle). The 
CEA-DOM and CEA-FcIgG cohorts immunized with DNA/DNA and Ad/Ad regimens were significantly 
different from the CEA vaccinated group. CEA-specific antibody titers of each individual mouse 
vaccinated with the DNA/DNA (B) or Ad/Ad (D) immunization regimen were measured by ELISA. 

30 Titers elicited by CEA-DOM and CEA-FcIgG vectors were significantly different from those elicited by 
CEA. 

FIGURE 5 shows the immunogenicity of the DNA/Ad regimen. Groups of 12 CEA 
transgenic mice were immunized with plasmid DNA (50 Hg/dose) and Adenovirus vectors (109 vp/dose) 
carrying the codon usage optimized cDNA of CEA, CEA-DOM or CEA-FcIgG. CEA-specific CD8 + T 
35 cells were measured by intracellular IFNy staining on PBMC of each immunized mouse (A). The 
average values for each cohort are also shown (filled circle). The CEA-DOM and CEA-FcIgG cohorts 
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were significantly different from the CEA vaccinated group. CEA-specific antibody titers of each 
individual mouse were measured by ELISA (B). Titers elicited by CEA-DOM and CEA-FcIgG vectors 
were significantly different from those elicited by CEA. Average values are shown (filled circles). 

FIGURE 6 shows the detection of CD4 + T cell response to tetanus toxoid protein. CEA 

5 transgenic mice were immunized with pVlJ/CEA-DOMopt as described in EXAMPLE 11. IFNy 
intracellular staining on pooled PBMC from immunized mice was performed with peptide p30. Whole 
lymphocyte gating and gating for CD8 + (R3) and CD4 + T cells (R4) are shown. 

FIGURE 7 shows the antitumor effect of vaccination with vectors carrying codon 
optimized cDNA of CEA, CEA-DOM or CEA-FcIgG. Groups of 10 CEA transgenic mice were 

10 immunized were with DNA/DNA (A), Ad/Ad (B) and DNA/Ad (C) vaccination regimens using plasmid 
DNA and Ad vectors carrying the codon usage optimized cDNAs of CEA, CEA-DOM or CEA-FcIgG, as 
described in EXAMPLE 13. Two weeks after the last injection, mice were challenged with sc 
inoculation of 5x10$ MC38-CEA tumor cells. Percentage of tumor free mice in the vaccinated groups 
was determined at weekly intervals and compared to that of untreated controls. Mice vaccinated with 

15 CEA-DOM vectors (DNA/Ad modality) was significantly different from control mice (log rank test 
p<0.05). 

FIGURE 8 shows the effect of CD4, CD8, or NK depletion on the induction of anti- 
tumor effect induced by CEA-DOM DNA/Ad immunization. CEA transgenic mice were immunized 
with repeated weekly injections of 50 *ig of pVlJ/CEA-DOMopt followed by a boost with 1x109 vp of 
20 Ad-CEA-DOMopt (EXAMPLE 14). One week after the last injection, mice were either not depleted, or 
were depleted of CD4 + T cells, CD8 + T cells, or NK cells. Two weeks after the last immunization, mice 
were challenged with sc inoculation of 5x10$ MC38-CEA tumor cells. Percentage of tumor free mice in 
the vaccinated groups was determined at weekly intervals and compared to that of untreated controls. 
The data indicate that the percentage of tumor-free mice in the vaccinated group was significantly 
25 different from untreated controls and depleted cohorts. 

FIGURE 9, shows the nucleotide sequence (SEQ ID NO:l) of an exemplary, fully 
optimized hCEA-DOM fusion. The amino acid sequence of the encoded protein is also shown (SEQ ID 
NO:25). The CEA portion of the nucleotide sequence of this particular CEA fusion consists of 
nucleotides 1 to 2037, which are codon-optimized for high-level expression in a human host cell. The 
30 DOM portion of the nucleotide sequence is shown in bold and is also codon-optimized for high-level 
expression in human cells. Junction sequences, created by the cloning strategy employed to fuse the 
CEA and LTB sequences are underlined. 

FIGURE 10 shows an exemplary nucleotide (SEQ ID NO:5) sequence of a hCEA- 
FcIgGopt fusion. The sequence of the encoded protein (SEQ ID NO:26) is also shown. The CEA 
35 portion of the nucleotide sequence of this particular CEA fusion consists of nucleotides 1 to 2037, which 

are codon-optimized for high-level expression in a human host cell. The FcIgG portion of the nucleotide 
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sequence, which is also codon-optimized for high-level expression in human cells, is shown in bold. 
Junction sequences, created by the cloning strategy employed to fuse the CEA and LTB sequences are 
underlined. 

FIGURE 11 shows the nucleotide sequence of a portion of the wild-type human CEA 
5 cDNA from nt 1 to nt 2037 (SEQ ID NO:2, Panel A), encoding a portion of the hCEA protein from aa 1 
to aa 679 (SEQ ID NO:3, Panel B). 

FIGURE 12 shows the non-optimized nucleotide sequence of the minimized domain of 
tetanus toxin fragment C (DOM) cDNA from nt 1 to nt 825 (SEQ ID NO:27), encoding the DOM 
protein, also shown (SEQ ID NO:28). 
10 FIGURE 13 shows the non-optimized nucleotide sequence of an exemplary hCEA-DOM 

fusion (SEQ ID NO:29). The CEA portion of the nucleotide sequence of this particular CEA fusion 
consists of nucleotides 1 to 2037. The DOM portion of the nucleotide sequence is shown in bold. 

FIGURE 14 shows the shows nucleotide sequences of wild-type genes encoding rhesus 
monkey CEA (Panels A and B, SEQ ID NOs:31 and 32) and the amino acid sequences of the 
15 corresponding proteins (Panels C and D, SEQ ID NOs:33 and 34), as previously described (WO 
2004/072287). Panel E shows the amino acid sequence of wild-type human CEA (SEQ ID NO.30), 
which was previously described {see, e.g., U.S. Patent No. 5,274,087). 

FIGURE 15 shows an exemplary nucleotide sequence (SEQ ID NO:35) of a rhesus 
monkey CEA- DOM fusion, herein designated rhCEA-DOMopt. The sequence of the encoded fusion 
20 protein (SEQ ID NO:36) is also shown. The CEA portion of the nucleotide sequence of this particular 
CEA fusion consists of nucleotides 1 to 2037, which are codon-optimized for high-level expression in a 
human host cell. The DOM portion of the nucleotide sequence, which is also codon-optimized for high- 
level expression in human cells, is shown in bold. 

FIGURE 16 shows an exemplary nucleotide sequence (SEQ ID NO:37) of a rhesus 
25 monkey CEA- CTB fusion, herein designated rhCEA-CTBopt. The sequence of the encoded fusion 
protein (SEQ ID NO:38) is also shown. The CEA portion of the nucleotide sequence of this particular 
CEA fusion consists of nucleotides 1 to 2037, which are codon-optimized for high-level expression in a 
human host cell. The CTB portion of the nucleotide sequence, which is also codon-optimized for high- 
level expression in human cells, is shown in bold. 

30 

DETAILED DESCRIPTION OF THE INVENTION 

Carcinoembryonic antigen (CEA) is commonly associated with the development of 
adenocarcinomas. The present invention relates to compositions and methods to elicit or enhance 
immunity to the protein product expressed by the CEA tumor-associated antigen, wherein aberrant CEA 
35 expression is associated with the carcinoma or its development. Association of aberrant CEA expression 
with a carcinoma does not require that the CEA protein be expressed in tumor tissue at all timepoints of 
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its development, as abnormal CEA expression may be present at tumor initiation and not be detectable 
late into tumor progression or vice-versa. 

To this end, the present invention provides polynucleotides, vectors, host cells, and 
encoded proteins comprising a CEA sequence or variant thereof for use in vaccines and pharmaceutical 
" 5 "compositions for the treatment and/or prevention of a cancer. The polynucleotides of the present 
invention comprise a nucleotide sequence encoding a CEA protein or variant thereof, fused to a 
nucleotide sequence encoding at least a subunit of an immunoenhancing element, such as a bacterial 
enterotoxin or substantial portion thereof, which can effectively adjuvant an immune response to the 
associated CEA. 

10 The CEA nucleotide sequences of the present invention can be of human origin or can be 

a CEA homolog from another species. The wild-type human CEA nucleotide sequence has been reported 
{see eg U.S. Patent No. 5,274,087; U.S. Patent No 5,571,710; and U.S. Patent No 5,843,761). The 
rhesus monkey CEA sequence was recently described (WO 2004/072287). The CEA portion of the CEA 
fusion may be full-length, or any variant sufficient to elicit a CEA-specific immune response in a 

15 mammal. CEA variants of the present invention include, but are not limited to, sequences that are C- or 
N-terminally truncated, sequences with conservative substitutions, and sequences with internal deletions 
or insertions. 

In preferred embodiments of the present invention, the CEA portion of the CEA fusion is 
human CEA (SEQ ID NO:30) or a functional equivalent thereof, for example, a human CEA deleted of 
20 its anchoring domain (SEQ. ID NO:3). In other preferred embodiments, the CEA portion is a rhesus 
monkey CEA (SEQ ID NOs:33 and 34), or functional equivalent thereof. 

. Accordingly, the present invention relates to a synthetic polynucleotide comprising a 
sequence of nucleotides encoding a CEA fusion protein, said fusion protein comprising a CEA protein or 
a biologically active fragment or mutant form of a CEA protein fused to an immunoenhancing element or 
25 substantial portion thereof, which can effectively enhance the immune response to the CEA protein. Said 
mutant forms of the CEA protein include, but are not limited to: conservative amino acid substitutions, 
amino-terminal truncations, carboxy-terminal truncations, deletions, or additions. Any such biologically 
active fragment and/or mutant will encode either a protein or protein fragment which at least 
substantially mimics the immunological properties of the CEA protein as set forth in SEQ ID NO:30. 
30 The synthetic polynucleotides of the present invention encode mRNA molecules that express a functional 
CEA fusion protein so as to be useful in the development of a therapeutic or prophylactic cancer vaccine. 

In preferred embodiments, the CEA portion of the encoded CEA fusion protein is deleted 
of its C-terminal anchoring domain (AD) , which is located from about amino acid 679 to about amino 
acid 702 of the human full-length CEA (SEQ ID NO:30) herein designated CEAAAD, as set forth in SEQ 
35 ID NO:3 and shown in FIGURE 11. While not being bound by theory, deletion of the anchoring domain 
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increases secretion of the CEA fusion protein, thereby enhancing cross priming of the CEA-LTB immune 
response. 

The immunoenhancing element portion of the CEA fusion proteins of the present 
invention are capable of stimulating or enhancing the immune response to the associated CEA protein 
5 and are selected from the group consisting of: heat shock protein (HSP) 70, lysosome-associated 
membrane protein (LAMP), fragment C of tetanus toxoid (FrC), the N-terminal domain of FrC (DOM), 
the heavy fragment of constant chain of immune globulin Gl'(FcIgG), the vesicular stomatitis virus 
glycoprotein (VSV-G), cholera toxin (CT) from Vibrio cholerae, and heat labile enterotoxin of E.coli 
(LT). In preferred embodiments of the present invention, the adjuvant portion of the CEA fusion 
10 comprises the N-terminal domain of FrC (DOM), which has been shown to strongly potentiate the 
immunogenicity of codelivered antigens. In further preferred embodiments, the adjuvant portion of the 
CEA fusion is a substantial portion of FcIgG. 

A CEA fusion comprising a truncated human CEA fused to a single epitope of tetanus 
toxin (Q830 - L844) has been described (Lund et al. Cancer Gene Therapy 10: 365-376 (2003)). Unlike 
15 this single-epitope fusion, the CEA fusions of the present invention comprise a substantial portion of an 
immunoenhancing element or subunit thereof, as described above, which is capable of enhancing the 
immunogenicity of a CEA protein or variant thereof. A substantial portion of an immunoenhancing 
element to be used for the compositions and methods described herein does not include portions that are 
less than 50% of a full-length immunoenhancing element or subunit thereof. The strategy used herein, 
20 which utilizes full-length adjuvant subunits or substantial portions thereof, was employed to ensure a 
greater immune response to the fused CEA sequence. While not being bound by theory, it is believed 
that if the immunoenhancing element chosen as adjuvant comprises greater than one helper epitope, 
limiting the adjuvant sequence of the fusion protein to a single epitope would arguably lead to a reduced 
effect on the immunogenicity of the target protein. Additionally, it is believed that if the adjuvant- 
25 mediated enhancement of the immune response is dependent on the interaction of the immunoenhancing 
element with specific cell receptors and not based on a universal epitope, then the receptor interaction 
could depend on a specific structural configuration that would require a substantial portion of the 
immunoenhancing element to exert an adjuvant effect. In such a case, a short adjuvant sequence 
comprising a single epitope would be insufficient in mediating an increase of the immune response. 
30 Also contemplated for use in the present invention are nucleotide sequences encoding 

variants or mutants of the immunoenhancing elements described herein, including, but not necessarily 
limited to: nucleotide substitutions, deletions, additions, amino-terminal truncations and carboxy- 
terminal truncations. In some cases, it may be advantageous to add specific point mutations to the 
nucleotide sequence encoding the adjuvant or subunit to reduce or eliminate toxicity of the encoded 
35 protein. 
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The immunoenhancing element, subunit, or substantial portion thereof may be fused to 
the amino terminus or the carboxy terminus of the CEA sequence. Further, the immunoenhancing 
element sequence and the CEA sequence can be fused N-terminus to N-terminus, C-terminus to C- 
terrninus, C-terminus .to. N-terminus or N-termmus to N-terminus. In preferred embodhnents of the 
5 present invention, the C-terminus of the CEA polypeptide is fused to the N-terminus of the 

immunoenhancing element. 

The present invention relates to a synthetic nucleic acid molecule (polynucleotide) 
comprising a sequence of nucleotides which encodes mRNA that expresses a novel CEA fusion protein; 
for example, nucleotide sequences encoding the fusion proteins as set forth in SEQ ID NOs:25, 26, 36 
10 and 38. The nucleic acid molecules of the present invention are substantially free from other nucleic 
acids. 

The present invention also relates to recombinant vectors and recombinant host cells, 
both prokaryotic and eukaryotic, which contain the nucleic acid molecules disclosed throughout this 
specification. The synthetic DNA molecules, associated vectors, and hosts of the present invention are 
15 useful for the development of a cancer vaccine. 

Exemplary nucleic acid molecules of the present invention comprise a nucleotide 
sequence selected from the group consisting of: SEQ ID NOs: 1, 5, 29, 35, and 37, as shown in FIGURES 
9, 10, 13, 15, and 16, which encode exemplary CEA fusion proteins of the present invention. 

The present invention also includes biologically active fragments or mutants of SEQ ID 
20 NOs: 1, 5, 29, 35, and 37, which encode mRNA expressing exemplary CEA fusion proteins. Any such 
. biologically active fragment and/or mutant will encode either a protein or protein fragment which at least 
substantially mimics the pharmacological properties of the hCEA protein, including but not limited to the 
hCEA protein as set forth in SEQ ID NO.30. Any such polynucleotide includes but is not necessarily 
limited to: nucleotide substitutions, deletions, additions, amino-terminal truncations and carboxy- 
25 terminal truncations. The mutations of the present invention encode mRNA molecules that express a 
functional CEA fusion protein in a eukaryotic cell so as to be useful in cancer vaccine development. 

Also included within the scope of this invention are mutations in the DNA sequence that 
do not substantially alter the ultimate physical properties of the expressed protein. For example, 
substitution of valine for leucine, arginine for lysine, or asparagine for glutamine may not cause a change 
30 in the functionality of the polypeptide. 

As stated above, the present invention further relates to recombinant vectors that 
comprise the nucleic acid molecules disclosed throughout this specification. These vectors may be 
comprised of DNA or RNA. For most cloning purposes, DNA vectors are preferred. Typical vectors 
include plasmids, modified viruses, baculovirus, bacteriophage, cosmids, yeast artificial chromosomes, 
35 and other forms of episomal or integrated DNA that can encode a CEA fusion protein. It is well within 
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the purview of the skilled artisan to determine an appropriate vector for a particular gene transfer or other 



use. 



Also provided by the present invention are purified CEA fusion proteins encoded by the 
nucleic acids disclosed throughout this specification. In exemplary embodiments of this aspect of the 
5 invention, the CEA fusion protein comprises a sequence of amino acids selected from the group 
consisting of: SEQ ID NOs: 25, 26, 36 and 38. 

Included in the present invention are DNA sequences that hybridize to the complement 
of SEQ ID NOs: 1, 5, 29, 35, and 37 under stringent conditions. By way of example, and not limitation, 
a procedure using conditions of high stringency is as follows. Prehybridization of filters containing DNA 
10 is carried out for about 2 hours to overnight at about 65°C in buffer composed of 6x SSC, 5x Denhardt's 
solution, and 100 /ig/ml denatured salmon sperm DNA. Filters are hybridized for about 12 to 48 hrs at 
65°C in prehybridization mixture containing 100 fig/ml denatured salmon sperm DNA and 5-20 x 10& 
cpm of 32 P _iabeled probe. Washing of filters is done at 37°C for about 1 hour in a solution containing 
2x SSC, 0.1% SDS. This is followed by a wash in O.lx SSC, 0.1% SDS at 50°C for 45 minutes before 
15 autoradiography. Other procedures using conditions of high stringency would include either a 
hybridization step carried out in 5x SSC, 5x Denhardt's solution, 50% formamide at about 42°C for 
about 12 to 48 hours or a washing step carried out in 0.2x SSPE, 0.2% SDS at about 65°C for about 30 to 
60 minutes. Reagents mentioned in the foregoing procedures for carrying out high stringency 
hybridization are well known in the art. Details of the composition of these reagents can be found in 
20 Sambrook et al., Molecular Cloning: A Laboratory Manual 2 nd Edition; Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, New York, (1989) or Sambrook and Russell, Molecular Cloning: A 
Laboratory Manual, 3rd Edition. Cold Spring Harbor Laboratory Press, Plainview, NY (2001). In 
addition to the foregoing, other conditions of high stringency which may be used are also well known in 
the art. 

25 An expression vector containing a CEA fusion protein-encoding nucleic acid molecule 

may be used for high-level expression of CEA fusion protein in a recombinant host cell. Expression 
vectors may include, but are not limited to, cloning vectors, modified cloning vectors, specifically 
designed plasmids or viruses. Also, a variety of bacterial expression vectors may be used to express 
recombinant CEA fusion sequences in bacterial cells if desired. In addition, a variety of fungal cell 
30 expression vectors may be used to express recombinant CEA fusion sequences in fungal cells. Further, a 
variety of insect cell expression vectors may be used to express recombinant protein in insect cells. 

The present invention also relates to host cells transformed or transfected with vectors 
comprising the nucleic acid molecules of the present invention. Recombinant host cells may be 
prokaryotic or eukaryotic, including but not limited to, bacteria such as E. coli, fungal cells such as yeast, 
35 mammalian cells including, but not limited to, cell lines of bovine, porcine, monkey and rodent origin; 
and insect cells including but not limited to Drosophila and silkworm derived cell lines. Such 
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recombinant host cells can be cultured under suitable conditions to produce a CEA fusion protein or a 
biologically equivalent form. In a preferred embodiment of the present invention, the host cell is human. 
As defined herein, the term "host cell" is not intended to include a host cell in the body of a transgenic 
human being, human fetus, or human embryos. 

As noted above, an expression vector containing DNA encoding a CEA fusion protein 
may be used for expression of CEA fusion protein in a recombinant host cell. Therefore, another aspect 
of this invention is a process for expressing a CEA fusion protein in a recombinant host cell, comprising: 
(a) introducing a vector comprising a nucleic acid comprising a sequence of nucleotides that encodes a 
CEA fusion protein into a suitable human host cell, wherein the CEA fusion protein comprises a CEA 
protein or variant thereof, fused to a substantial portion of an immunoenhancing element or subunit 
thereof, wherein the immunoenhancing element or subunit thereof is selected from the group consisting 
of: heat shock protein (HSP) 70, lysosome-associated membrane protein (LAMP), fragment C of tetanus 
toxoid (FrC), the N-terminal domain of FrC (DOM), the heavy fragment of constant chain of immune 
globulin Gl (FcIgG), the vesicular stomatitis virus glycoprotein (VSV-G), cholera toxin (CT) from 
Vibrio cholerae, and heat labile enterotoxin of E.coli (LT); and wherein the fusion protein is capable of 
producing an immune response in a mammal; and, (b) culturing the host cell under conditions which 
allow expression of said CEA fusion protein. 

A preferred immunoenhancing element for use in this aspect of the invention is DOM. 

Another preferred immunoenhancing element is FcIgG. 

In a further preferred embodiment of this aspect of the invention, the nucleotide 
sequence of the CEA portion of the fusion and/or the immunoenhancing element portion of the fusion are 
codon-optimized for high-level expression in human cells. 

This invention also provides a process for expressing a CEA-DOM fusion protein in a 
recombinant host cell, comprising: (a) introducing a vector comprising a nucleic acid comprising a 
sequence of nucleotides that encodes a CEA-DOM fusion protein into a suitable human host cell, 
wherein the CEA fusion protein comprises a CEA protein or variant thereof, fused to a substantial 
portion of the N-terminal domain of fragment C of tetanus toxin (DOM), and wherein the fusion protein 
is capable of producing an immune response in a mammal; and, (b) culturing the host cell under 
conditions which allow expression of said CEA-DOM fusion protein. 

In preferred embodiments of the process for expressing a CEA-DOM fusion protein 
described above, the DOM portion is codon-optimized for high-level expression in human cells. In other 
preferred embodiments, the CEA portion of the CEA fusion is codon-optimized for high-level expression 
in human cells. In still further preferred embodiments, both the CEA and the DOM portions are codon- 
optimized for high-level expression in human cells. 

Following expression of a CEA fusion in a host cell, CEA fusion protein may be 
recovered to provide CEA fusion protein in active form. Several protein purification procedures are 
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available and suitable for use. Recombinant protein may be purified from cell lysates and extracts by 
various combinations of, or individual application of salt fractionation, ion exchange chromatography, 
size exclusion chromatography, hydroxylapatite adsorption chromatography and hydrophobic interaction 
chromatography. In addition, recombinant CEA fusion protein can be separated from other cellular 
proteins by use of an immunoaffinity column made with monoclonal or polyclonal antibodies specific for 
a CEA protein, or polypeptide fragments of a CEA protein. 

The nucleic acid molecules comprising CEA fusions and the encoded fusion proteins of 
this invention were designed to enhance the CEA-specific immune response, relative to full-length CEA 
cDNA, for use in vaccine development. To further enhance the immunogenic properties of the CEA 
fusion'sequences of the present invention, in some embodiments described herein, the polynucleotides 
encoding CEA fusion proteins comprise optimized codons for further high level expression in a host cell, 
as described below. In these embodiments, at least a portion of the codons of the CEA fusions are 
designed so as to use the codons preferred by the projected host cell, which in preferred embodiments is 
a human cell. The optimized CEA fusions may be used for the development of recombinant adenovirus 
or plasmid-based DNA vaccines, which provide effective immunoprophylaxis against CEA-associated 
cancer through neutralizing antibody and cell-mediated immunity. The synthetic molecules may be used 
as an immunogenic composition. This invention provides codon-optimized CEA fusion polynucleotides 
which, when directly introduced into a vertebrate in vivo, including mammals such as primates and 
humans, induce the expression of encoded proteins within the animal. 

As stated above, in some embodiments of the present invention, the synthetic molecules 
comprise a sequence of nucleotides, wherein some of the nucleotides have been altered so as to use the 
codons preferred by a human cell, thus allowing for high-level fusion protein expression in a human host 
cell. The synthetic molecules may be used as a source of a CEA fusion protein, for example, CEA-DOM 
fusion protein, which may be used in a cancer vaccine to provide effective immunoprophylaxis against 
CEA-associated carcinomas through neutralizing antibody and cell-mediated immunity. The nucleic acid 
molecules disclosed herein may also serve as the basis for a DNA-based cancer vaccine. 

A "triplet" codon of four possible nucleotide bases can exist in over 60 variant forms. 
Because these codons provide the message for only 20 different amino acids (as well as transcription 
initiation and termination), some amino acids can be coded for by more than one codon, a phenomenon 
known as codon redundancy. For reasons not completely understood, alternative codons are not 
uniformly present in the endogenous DNA of differing types of cells. Indeed, there appears to exist a 
variable natural hierarchy or "preference" for certain codons in certain types of cells. As one example, 
the amino acid leucine is specified by any of six DNA codons including CTA, CTC, CTG, CTT, TTA, 
and TTG. Exhaustive analysis of genome codon frequencies for microorganisms has revealed 
endogenous DNA of B. coli most commonly contains the CTG leucine-specifying codon, while the DNA 
of yeasts and slime molds most commonly includes a TTA leucine-specifying codon. In view of this 
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hierarchy, it is generally believed that the likelihood of obtaining high levels of expression of a leucine- 
rich polypeptide by an E. coli host will depend to some extent on the frequency of codon use. For 
example, it is likely that a gene rich in TTA codons will be poorly expressed in E. coli, whereas a CTG 
rich gene will probably be highly expressed in this host. Similarly, a preferred codon for expression of a 
5 leucine-rich polypeptide in yeast host cells would be TTA. 

The implications of codon preference phenomena on recombinant DNA techniques are 
manifest, and the phenomenon may serve to explain many prior failures to achieve high expression levels 
of exogenous genes in successfully transformed host organisms-a less "preferred" codon may be 
repeatedly present in the inserted gene and the host cell machinery for expression may not operate as 
10 efficiently. This phenomenon suggests that synthetic genes which have been designed to include a 
projected host cell's preferred codons provide an optimal form of foreign genetic material for practice of 
recombinant DNA techniques. Thus, one aspect of this invention is a CEA fusion gene that is codon- 
optimized for expression in a human cell. In a preferred embodiment of this invention, it has been found 
that the use of alternative codons encoding the same protein sequence may remove the constraints on 
15 expression of exogenous CEA fusion protein in human cells. 

In accordance with some embodiments of the present invention, the nucleic acid 
molecules which encode the CEA fusion proteins are converted to a polynucleotide sequence having an 
identical translated sequence but with alternative codon usage as described by Lathe, "Synthetic 
Oligonucleotide Probes Deduced from Amino Acid Sequence Data: Theoretical and Practical 
20 Considerations" J. Molec. Biol 183:1-12 (1985), which is hereby incorporated by reference. The 
methodology generally consists of identifying codons- in the wild-type sequence that are not commonly 
associated with highly expressed human genes and replacing them with optimal codons for high 
expression in human cells. The new gene sequence is then inspected for undesired sequences generated 
by these codon replacements (e.g., "ATTTA" sequences, inadvertent creation of intron splice recognition 
25 sites, unwanted restriction enzyme sites, etc.). Undesirable sequences are eliminated by substitution of 
the existing codons with different codons coding for the same amino acid. The synthetic gene segments 
are then tested for improved expression. 

The methods described above were used to create synthetic gene sequences which 
encode CEA fusion proteins of the present invention, resulting in a gene comprising codons optimized 
30 for high level expression. While the above procedure provides a summary of our methodology for 
designing codon-optimized genes for use in cancer vaccines, it is understood by one skilled in the art that 
similar vaccine efficacy or increased expression of genes may be achieved by minor variations in the 
procedure or by minor variations in the sequence. 

One of skill in the art will also recognize that additional nucleic acid molecules may be 
35 constructed that provide for high levels of CEA fusion expression in human cells, wherein only a portion 
of the codons of the DNA molecules are codon-optimized. For example, in some embodiments of the 
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present invention, codons comprising the CEA portion of the CEA fusion are optimized for high-level 
expression in human cells, and codons comprising the adjuvant portion of the CEA fusion are 
substantially similar to the wild-type adjuvant-encoding nucleotide sequence . In other embodiments of 
the present invention, codons comprising the adjuvant portion of the CEA fusion are optimized for high- 
level expression in human cells, and codons comprising the CEA portion of the CEA fusion are 
substantially similar to a wild-type CEA gene. In still other embodiments of the present invention, both 
the CEA and the adjuvant portions of the CEA fusion are codon-optimized for high-level expression in 
human cells. CEA fusions in which only a subset of codons are optimized within the CEA and/or the 
adjuvant portion of the CEA fusion are also contemplated by this invention. 

The nucleic acids of the present invention may be assembled into an expression cassette 
which comprises sequences designed to provide for efficient expression of the protein in a human cell. 
The cassette preferably contains CEA fusion protein-encoding gene, with related transcriptional and 
translations control sequences operatively linked to it, such as a promoter, and termination sequences. In 
a preferred embodiment, the promoter is the cytomegalovirus promoter without the intron A sequence 
(CMV), although those skilled in the art will recognize that any of a number of other known promoters 
such as the strong immunoglobulin, or other eukaryotic gene promoters may be used. A preferred 
transcriptional terminator is the bovine growth hormone terminator, although other known transcriptional 
terminators may also be used. The combination of CMV-BGH terminator is particularly preferred. 

In accordance with this invention, the CEA fusion expression cassette is inserted into a 
vector. The vector is preferably an adenoviral or plasmid vector, although linear DNA linked to a 
promoter, or other vectors, such as adeno-associated virus or a modified vaccinia virus, retroviral or 
lentiviral vector may also be used. 

If the vector chosen is an adenovirus, it is preferred that the vector be a so-called first- 
generation adenoviral vector. These adenoviral vectors are characterized by having a non-functional El 
gene region, and preferably a deleted adenoviral El gene region. In some embodiments, the expression 
cassette is inserted in the position where the adenoviral El gene is normally located. In addition, these 
vectors optionally have a non-functional or deleted E3 region. It is preferred that the adenovirus genome 
used be deleted of both the El and E3 regions (AE1AE3). The adenoviruses can be multiplied in known 
cell lines which express the viral El gene, such as 293 cells, or PERC.6 cells, or in cell lines derived 
from 293 or PERC.6 cell which are transiently or stablily transformed to express an extra protein. For 
examples, when using constructs that have a controlled gene expression, such as a tetracycline 
regulatable promoter system, the cell line may express components involved in the regulatory system. 
One example of such a cell line is T-Rex-293; others are known in the art. 

For convenience in manipulating the adenoviral vector, the adenovirus may be in a 
shuttle plasmid form. This invention is also directed to a shuttle plasmid vector which comprises a 
plasmid portion and an adenovirus portion, the adenovirus portion comprising an adenoviral genome 
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which has a deleted El and optional E3 deletion, and has an inserted expression cassette comprising a 
CEA fusion protein encoding nucleotide sequence. In preferred embodiments, there is a restriction site 
flanking the adenoviral portion of the plasmid so that the adenoviral vector can easily be removed. The 
shuttle plasmid may be replicated in prokaryotic cells or eukaryotic cells. 
5 In a preferred embodiment of the invention, the expression cassette is inserted into the 

pMRKAd5-HV0 adenovirus plasmid (.See Emini et al., WO 02/22080, which is hereby incorporated by 
reference). This plasmid comprises an Ad5 adenoviral genome deleted of the El and E3 regions. The 
design of the pMRKAd5-HV0 plasmid was improved over prior adenovectors by extending the 5' cis- 
acting packaging region further into the El gene to incorporate elements found to be important in 
10 optimizing viral packaging, resulting in enhanced virus amplification. Advantageously, this enhanced 
adenoviral vector is capable of maintaining genetic stability following high passage propagation. 

Standard techniques of molecular biology for preparing and purifying DNA constructs 
enable the preparation of the adenoviruses, shuttle plasmids, and DNA immunogens of this invention. 

It has been determined in accordance with the present invention that the CEA fusion 
15 protein-encoding molecules described herein (e.g. SEQ ID NO:l) are expressed with equivalent 
efficiency compared to the corresponding wild type CEA sequence (See EXAMPLE 4). It has also been 
shown herein that plasmids pVU/hCEA-DOM and pVU/hCEA-FcIgG elicited a greater CEA-specific 
cell-mediated and humoral immune response than CEA (See EXAMPLE 10). It has also been shown in 
accordance with the present invention that tolerance to the CEA self antigen can be broken more 
20 efficiently with the Dom and FcIgG CEA fusions described herein, relative to the full-length wild-type 
CEA cDNA, due to the increased immunogenic properties of the CEA fusions. The enhanced 
immunogenic properties of these fusion proteins was observed upon immunization with DNA or Ad 
vectors, indicating that enhanced immunogenicity of CEA-LT fusions is not limited to plasmid DNA 
immunization (see EXAMPLE 1 1). 
25 Therefore, the vectors described above may be used in immunogenic compositions and 

vaccines for preventing the development of adenocarcinomas associated with aberrant CEA expression 
and/or for treating existing cancers. The vectors of the present invention allow for vaccine development 
and commercialization by eliminating difficulties with obtaining high expression levels of exogenous 
CEA in successfully transformed host organisms and by providing a CEA fusion protein which can elicit 
30 an enhanced immune response when administered to a mammal such as a human being. 

To this end, one aspect of the instant invention is a method of preventing or treating 
CEA-associated cancer comprising administering to a mammal a vaccine vector comprising a 
polynucleotide comprising a sequence of nucleotides that encodes a CEA fusion protein, wherein the 
CEA fusion protein comprises a CEA protein or variant thereof, fused to a substantial portion of an 
35 immunoenhancing element selected from the group consisting of: heat shock protein (HSP) 70, 
lysosome-associated membrane protein (LAMP), fragment C of tetanus toxoid (FrC), the N-terminal 
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domain of FrC (DOM), the heavy fragment of constant chain of immune globulin Gl (FcIgG), the 
vesicular stomatitis virus glycoprotein (VSV-G), cholera toxin (CT) from Vibrio cholerae, and heat 
labile enterotoxin of Kcoli (LT); and wherein the fusion protein is capable of producing an immune 
response in a mammal. 

5 In preferred embodiments of the methods described herein, the Immunoenhancing 

element is DOM or FcIgG. 

In accordance with the method described above, the vaccine vector may be administered 
for the treatment or prevention of a cancer in any mammal, including but not limited to: lung cancer, 
breast cancer, and colorectal cancer. In a preferred embodiment of the invention, the mammal is a 
10 human. 

Further, one of skill in the art may choose any type of vector for use in the treatment and 
prevention method described. Preferably, the vector is an adenovirus vector or a plasmid vector. In a 
preferred embodiment of the invention, the vector is an adenoviral vector comprising an adenoviral 
genome with a deletion in the adenovirus El region, and an insert in the adenovirus El region, wherein 
15 the insert comprises an expression cassette comprising: (a) a sequence of nucleotides that encodes a CEA 
fusion protein, wherein the CEA fusion protein comprises a CEA protein or variant thereof, fused to a 
substantial portion of an immunoenhancing element or substantial portion thereof; wherein the 
immunoenhancing element is selected from the group consisting of: heat shock protein (HSP) 70, 
lysosome-associated membrane protein (LAMP), fragment C of tetanus toxoid (FrC), the N-terminal 
20 domain of FrC (DOM), the heavy fragment of constant chain of immune globulin Gl (FcIgG), the 
vesicular stomatitis virus glycoprotein (VSV-G), cholera toxin (CT) from Vibrio cholerae, and heat 
labile enterotoxin of Kcoli (LT); and wherein the fusion protein is capable of producing an immune 
response in a mammal; and (b) a promoter operably linked to the polynucleotide. 

The instant invention further relates to an adenovirus vaccine vector comprising an 
25 adenoviral genome with a deletion in the El region, and an insert in the El region, wherein the insert 
comprises an expression cassette comprising: (a) a sequence of nucleotides that encodes a CEA fusion 
protein, wherein the CEA fusion protein comprises a CEA protein or variant thereof, fused to a 
substantial portion of an immunoenhancing element or substantial portion thereof; wherein the 
immunoenhancing element is selected from the group consisting of: HSP70, LAMP, FrC, DOM, the 
30 FcIgG, VSV-G, CT and LT; and wherein the fusion protein is capable of producing an immune response 
in a mammal; and (b) a promoter operably linked to the polynucleotide. 

In a preferred embodiment of this aspect of the invention, the adenovirus vector is an Ad 



5 vector. 
35 vector. 



In another preferred embodiment of the invention, the adenovirus vector is an Ad 6 

In yet another preferred embodiment, the adenovirus vector is an Ad 24 vector. 

20 
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Also contemplated for use in the present invention is an adenovirus vaccine vector 
comprising a adenovirus genome that naturally infects a species other than human, including, but not 
limited to, chimpanzee adenoviral vectors. A preferred embodiment of this aspect of the invention is a 
chimp Ad 3 vaccine vector. 

5 In another aspect, the invention relates to a vaccine plasmid comprising a plasmid 

portion and an expression cassette portion, the expression cassette portion comprising: (a) a sequence of 
nucleotides that encodes a CEA fusion protein, wherein the CEA fusion protein comprises a CEA protein 
or variant thereof, fused to a substantial portion of an immunoenhancing element or substantial portion 
thereof, selected from the group consisting of: HSP70, LAMP, FrC, DOM, the FcIgG, VSV-G, CT and 
10 LT; and wherein the fusion protein is capable of producing an immune response in a mammal; and (b) a 
promoter operably linked to the polynucleotide. 

In some embodiments of this invention, the recombinant adenovirus and plasmid-based 
polynucleotide vaccines disclosed herein are used in various prime/boost combinations in order to induce 
an enhanced immune response. In this case, the two vectors are administered in a "prime and boost" 
15 regimen. For example the first type of vector is administered one or more times, then after a 
predetermined amount of time, for example, 2 weeks, 1 month, 2 months, six months, or other 
appropriate interval, a second type of vector is administered one or more times. Preferably the vectors 
carry expression cassettes encoding the same polynucleotide or combination of polynucleotides. In the 
embodiment where a plasmid DNA is also used, it is preferred that the vector contain one or more 
20 promoters recognized by mammalian or insect cells. In a preferred embodiment, the plasmid would 
contain a strong promoter such as, but not limited to, the CMV promoter. The synthetic CEA fusion 
gene or other gene to be expressed would be linked to such a promoter. An example of such a plasmid 
would be the mammalian expression plasmid VUns as described (J. Shiver et. al. in DNA Vaccines, M. 
Liu et al. eds., N.Y. Acad. Sci., N.Y., 772:198-208 (1996), which is herein incorporated by reference). 
25 As stated above, an adenoviral vector vaccine and a plasmid vaccine may be 

administered to a vertebrate as part of a single therapeutic regime to induce an immune response. To this 
end, the present invention relates to a method of protecting a mammal from a CEA-associated cancer 
comprising: (a) introducing into the mammal a first vector comprising: i) a sequence of nucleotides that 
encodes a CEA fusion protein, wherein the CEA fusion protein comprises a CEA protein or variant 
30 thereof, fused to a substantial portion of an immunoenhancing element selected from the group consisting 
of: HSP70, LAMP, FrC, DOM, the FcIgG, VSV-G, CT and LT; and wherein the fusion protein is 
capable of producing an immune response in a mammal; and ii) a promoter operably linked to the 
polynucleotide; (b) allowing a predetermined amount of time to pass; and (c) introducing into the 
mammal a second vector comprising: i) a sequence of nucleotides that encodes a CEA fusion protein, 
35 wherein the CEA fusion protein comprises a CEA protein or variant thereof, fused to a substantial 
portion of an immunoenhancing element selected from the group consisting of: HSP70, LAMP, FrC, 
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DOM, the FcIgG, VSV-G, CT and LT; and wherein the fusion protein is capable of producing an 
immune response in a mammal; and ii) a promoter operably linked to the polynucleotide. 

In one embodiment of the method of protection described above, the first vector is a 
plasmid and the second vector is an adenovirus vector. .Jn an alternative embodiment, the first vector is 
5 an adenovirus vector and the second vectpr is a plasmid. 

In the method described above, the first type of vector may be administered more than 
once, with each administration of the vector separated by a predetermined amount of time. Such a series 
of administration of the first type of vector may be followed by administration of a second type of vector 
one or more times, after a predetermined amount of time has passed. Similar to treatment with the first 
10 type of vector, the second type of vector may also be given one time or more than once, following 
predetermined intervals of time. 

The instant invention further relates to a method of treating a mammal suffering from a 
CEA-associated adenocarcinoma comprising: (a) introducing into the mammal a first vector comprising: 
i) a sequence of nucleotides that encodes a CEA fusion protein, wherein the CEA fusion protein 
15 comprises a CEA protein or variant thereof, fused to a substantial portion of an immunoenhancing 
element selected from the group consisting of: HSP70, LAMP, FrC, DOM, the FcIgG, VSV-G, CT ind 
LT; and wherein the fusion protein is capable of producing an immune response in a mammal; and ii) a 
promoter operably linked to the polynucleotide; (b) allowing a predetermined amount of time to pass; 
and (c) introducing into the mammal a second vector comprising: i) a sequence of nucleotides that 
20 encodes a CEA fusion protein, wherein the CEA fusion protein comprises a CEA protein or variant 
thereof, fused to a substantial portion of an immunoenhancing element selected from the group consisting 
of: HSP70, LAMP, FrC, DOM, the FcIgG, VSV-G, CT and LT; and wherein the fusion protein is 
capable of producing an immune response in a mammal; and ii) a promoter operably linked to the 
polynucleotide. 

25 In one embodiment of the method of treatment described above, the first vector is a 

plasmid and the second vector is an adenovirus vector. In an alternative embodiment, the first vector is 

an adenovirus vector and the second vector is a plasmid. 

In preferred embodiments of the methods described above, the vectors comprise a 

sequence of nucleotides that encode a CEA-DOM fusion protein, wherein the CEA fusion protein 
30 comprises a CEA protein or variant thereof, fused to a substantial portion of a DOM subunit. In further 

preferred embodiments, the vector comprises a sequence of nucleotides that encodes a CEA-FcIgG 

fusion protein. 

The amount of expressible DNA or transcribed RNA to be introduced into a vaccine 
recipient will depend partially on the strength of the promoters used and on the immunogenicity of the 
35 expressed gene product. In general, an immunologically or prophylactically effective dose of about 1 ng 
to 100 mg and preferably about 10 \ig to 300 \ig of a plasmid vaccine vector is administered directly into 
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muscle tissue. An effective dose for recombinant adenovirus is approximately 106 — 10*2 particles and 
preferably about 10? — 10* Iparticles. Subcutaneous injection, intradermal introduction, impression 
though the skin, and other modes of administration such as intraperitoneal, intravenous, intramuscular or 
inhalation delivery are also contemplated. 
5 In preferred embodiments of the present invention, the vaccine vectors are introduced to 

the recipient through intramuscular injection. 

It may be desirable for the vaccine vectors of the present invention to be in a 
physiologically acceptable solution, such as, but not limited to, sterile saline or sterile buffered saline. 
Alternatively, it may be advantageous to administer an agent which assists in the cellular uptake of DNA, 
10 such as, but not limited to calcium ion. These agents are generally referred to as transfection facilitating 
reagents and pharmaceutically acceptable carriers. Those of skill in the art will be able to determine the 
particular reagent or pharmaceutically acceptable carrier as well as the appropriate time and mode of 
administration. 

All publications mentioned herein are incorporated by reference for the purpose of 
15 describing and disclosing methodologies and materials that might be used in connection with the present 
invention. Nothing herein is to be construed as an admission that the invention is not entitled to antedate 
such disclosure by virtue of prior invention. 

Having described preferred embodiments of the invention with reference to the 
accompanying drawings, it is to be understood that the invention is not limited to those precise 
20 embodiments, and that various changes and modifications may be effected therein by one skilled in the 
art without departing from the scope or spirit of the invention as defined in the appended claims. 
The following examples illustrate, but do not limit the invention. 

EXAMPLE 1 

25 Plasmid constructs. 

pVU/CEA op t and pVU/CEA : These two constructs carry the codon usage optimized 

and wild type cDNA of CEA, respectively. The CEA coding sequence is located between the 
CMV/intronA immediate early promoter of cytomegalovirus and the bovine growth hormone 
polyadenylation signal. All the constructs encoding CEA fusion proteins were generated by fusing the 

30 CEA cDNA from nt 1 to nt 2037 (SEQ ID NO:2), corresponding to aa 1 to aa 679 (SEQ ID NO: 3), with 
the cDNA fragment corresponding to the following: fragment C of tetanus toxoid (CEA-FrC, SEQ ID 
NO:4), the N-terminal domain of FrC (CEA-DOM, SEQ ID NO: 1), the heavy fragment of constant chain 
of immune globulin Gl (CEA-FcIgG, SEQ ID NO:5), the lysosome-associated membrane protein (CEA- 
LAMP, SEQ ID NO:6), the heat shock protein 70 (CEA-HSP70, SEQ ID NO:7), or the vesicular 

35 stomatitis virus glycoprotein (CEA-VSV-G, SEQ ID NO:8). 
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pVU/CEA-FrC. P V1 J/CEA-D OM pVU/C EA-FcIgG. pVU/CEA-LAMP, pVU/CEA- 
HSP70 and pV 1 J/CEA-VSV-G : FrC and DOM coding sequences were obtained by PCR amplification 
from pRep-TeT.C plasmid as described in Rice et al. (J. Immunol. 169: 3908-13 (2002)). FcIgG was 
obtained from total RNA of human PBMC. VSV-G and HSP70 were obtained from p-FAST-VSV-G and 
5 from plasmid pY3111, respectively. LAMP1 was obtained by gene assembly. Amplifications were 
carried out using the following primers: FrC sense (5'-T AT T C TA GATTCAACACCAATT 
CCATTTTCTTATTC-3' (SEQ ED NO:9>FrC antisense (5*-G CGGCCGCTAGAATCA 
TTTGTCCATCCTTCATC-3' (SEQ ID NO.10), DOM sense (5'-T AT TC TA G AT T C A 
ACACCAATTCCATTTTCTTATTC-3' (SEQ ID NO: 1 1) DOM antisense (5- T T A G C 
10 GGCCGCTAGTTCTGTATCATATGGTAAAGGG-3' (SEQ ID NO:12), FcIgG 
sense (5'-T C TA GATAAAACTCACACATGCCCA-3' (SEQ ID NO:13) FcIgG antisense 
(5'-G CCGACTCATTTACCCGGAGACAGGGAG-3' (SEQ ID NO:14), LAMP sense 
(5-T CTAGATTTGATCCCCATTGCTGTGGGCGGTGCCCTG-3' (SEQ ID 
NO:15) LAMP antisense (5'- GGCGTGACTCCTCTTCCTGCCAATGAGGTAGGC 
15 A A T G A G -3' (SEQ ID NO:16), VSV-G sense (5'-A TATC TA GATTTCACCATAGTTT 
TTCCACACAACC-3' (SEQ ID NO:17) VSV-G antisense (5'-G CGGCCGCCTTCCTT 
CCAAGTCGGTTCATCTCTATG-3' (SEQ ID NO: 18), HSP70 sense (5'-G CTCTAGA 
TATGGCT.CGTGCGGTCGGGATCGACC-3' (SEQ ID NO:19)) and HSP70 antisense 
(5'G CCGCGGCCGCTCACTTOGCCTCCCGGCCGTCGTCO-3* (SEQ ID 
20 NO:20). The amplified DNA was introduced at the 3' end of the CEA coding sequence generating 
plasmids pVU/CEA-FrC, pVU/CEA-DOM, pV 1 J/CEA-FcIgG, pVU/CEA-LAMP, P V1J/CEA-HSP70 
and pVl J/CEA-VSV-G. 

pVU/CEA-DOMopt and pV 1 J/CEA-FcIeGopt : The codon usage optimized cDNA of 
DOM and FcIgG were synthesized by oligonucleotide assembly (Geneart GmbH, Regensburg, Germany) 
25 and cloned in pCR-script vector (Stratagene, La Jolla, CA). To generate pVU/CEA-DOMopt, DOM op t 
was amplified by PCR using the following primers: Domopt sense (5 - GT T A TCTA G A A G C A C 
CCCCATCCC -3' (SEQ ID NO:21)) and Domopt reverse (5'- TTAAGATCTCTAAGAT 
CTGGTGTCGTATCTCAGGGG-3' (SEQ ID NO:22). The amplified product was then 
inserted into the XbaVBgUl sites of plasmid pVU/CEAopt. To generate pVlJ/CEA-FdgGopt, FcIgG op t 
30 was amplified by PCR using the following primers: FcIgG op t sense (5'-T T A TCTA GAAAGACC 
CACACCTGCCCCCCTTGC-3* (SEQ ID NO:23))and as FcIgGopt reverse (5'- T A T A G A 
TCTTAGGGTACCTTACTTGCCGGGG-3' (SEQ ID NO:24)) the amplified product was 
inserted into XbaVBglB. sites of plasmid pVU/CEAopt. 

35 EXAMPLE 2 

Adenovirus Vectors. 

24 
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Ad5/CEAo pt : Plasmid pCR-CEAopt was digested with EcoRI. The resulting 2156 bp 
insert was purified and cloned into the EcoKL of the polyMRK~Ad5 shuttle plasmid. 

Ad5/CEA : The shuttle plasmid pMRK-CEA for generation of Ad5 vector was obtained 
by digesting plasmid pDeltalsplB/CEA with Sspl and EcoKV. The 9.52 kb fragment was then ligated to 
5 a 1272 bp £gZII/#amHI-restricted, Klenow -treated product from plasmid polyMRK. A PacVStul 
fragment from pMRK-CEA and pMRK-CEAopt containing the expression cassette for CEA and El 
flanking Ad5 regions was recombined to Clal linearized plasmid pAd5 in BJ5183 E. coli cells. The 
resulting plasmids were pAd5-CEA and pAd5-CEAopt, respectively. Both plasmids were cut with Pad 
to release the Ad ITRs and transfected in PerC-6 cells. Amplification of Ad5 vectors was carried out by 
10 serial passage. MRKAd5/CEA and MRKAd5/CEAopt or fusion were purified through standard CsCl 
gradient purification and extensively dialyzed against A105 buffer (5mM Tris-Cl pH 8.0, ImM MgCl2, 

75 mM NaCl, 5% Sucrose, 0.005 Tween 20). 

Ad5/CEA-DOMont and Ad5/CEA-FdgGor>t : Plasmid pMRK-CEA-DOMopt and 
pMRK-CEA-FdgGopt were constructed by cutting polyMRK-Ad5 shuttle plasmid with Swal and by 
15 ligating the linearized vector with the 2.9 kb DNA fragment derived from pVU/CEA-DOMopt or 

ligating the linearized vector with the 2700 bp DNA fragment derived from pVU/CEA-FdgGlopt that 
had been restricted with EcdBl, BglH and treated with Klenow. pMRK-CEA-FcIgGopt and pMRK-CEA- 
DOMopt were linearized and recombined into the Ad genome as indicated above. 

20 EXAMPLE 3 

Construction of CEA fusion proteins. 

To determine the immunogenicity of CEA fusion proteins, a series of vectors were 
constructed encoding aminoacids 1 to 679 of the human CEA protein fused to a panel of selected 
polypeptides (see EXAMPLE 1). These sequences were chosen in view of their reported 

25 immunoenhancing properties, which have been demonstrated in a variety of experimental systems. The 
CEA fusions were engineered by joining the cDNA of the CEA protein deleted of the GPI anchoring 
sequence to the foreign polypeptides (FIGURE 1 A). The tumor antigen was linked to the HSP70, FcIgG 
or LAMP sequences to determine whether enhancement of antigen uptake or retargeting to the 
endosomal compartment would lead to an increased immune response. Similarly, fusion to fragment C 

30 of tetanus toxin (FrC) or to a minimal domain devoid of potentially competitive MHC class I binding 

epitopes (DOM) (Rice et al. J. Immunol 169: 3908-13 (2002)) was constructed to promote humoral and 
CD4 + T cell responses. Lastly, CEA was linked to the VSV-G coding sequence to determine whether 
fusion to a viral glycoprotein would influence the immunogenic properties of CEA. 

The coding sequences of the CEA-fusions were cloned into vectors pVUns under the 

35 control of the human CMV/intron A promoter plus the bovine growth hormone (BGH) polyadenylation 
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signal (EXAMPLE 1). Plasmids pVU/CEA-FRC, pVU/CEA-DOM, pVU/CEA-FcIgG, pVU/CEA- 
LAMP, pVU/CEA-VSV-G, and pVU/CEA-HSP70 carry the wild type cDNA of CEA fused to the 
coding sequences of the indicated foreign polypeptides. 

5 EXAMPLE 4 

Detection of CEA expression. 

CEA expression was monitored by ELISA and Western blot analysis. Expression of the 
fusion proteins in transfected cells was examined by Western blot analysis using antibodies specific for 
CEA, VSV-G, FcIgG, tetanus toxin, or HSP70. HeLa cells were either transfected with the indicated 

10 plasmid or infected with the selected Ad vector. After 48 hr incubation, whole cell lysates and culture 
supernatant were harvested. 

CEA expression in cell lysate or supernatant was also monitored using the Direct Elisa 
CEA Kit (DBC-Diagnostics Biochem Canada Inc). CEA protein was detected with the antibody specific 
for the fused polypeptide in transfected cell lysates, whereas no expression of the target antigen was 

15 observed in the mock transfected control samples (FIGURE IB). The molecular mass of the fusion 
proteins did not differ significantly from that of CEA. This apparent lack of difference in molecular 
mass between the various CEA polypeptides is probably due to the high degree of glycosylation of the 
tumor antigen. 

To compare the efficiency of expression of the vectors encoding the CEA-fusions to that 
20 of pVU/CEA, HeLa cells were transfected with the different plasmids and CEA expression of these 

constructs was determined two days post transfection by ELISA. Plasmids pVU/CEA-FrC, pVU/CEA- 
DOM, pVU/CEA-FdgG, pVU/CEA-LAMP, pVU/CEA-VSV-G and pVU/CEA-HSP70 expressed CEA 
with comparable efficiency as pVU/CEA (FIGURE 2A). Most of the fusion proteins were secreted and 
detected in the cell supernatant; however, CEA-LAMP was not released from the transfected cells, 
25 probably due to its re-routing to the lysosomal compartment. (FIGURE 2B). Thus, these results indicate 
that the cDNA encoding the CEA-FrC, CEA-DOM, CEA-VSV-G, CEA-FcIgG, CEA-HSP70 and CEA- 
LAMP fusion proteins are expressed with equivalent efficiency to that of the cDNA encoding the full 
length CEA protein. 

30 EXAMPLE 5 

Peptides. 

Lyophilized CEA peptides were purchased from Bio-Synthesis and resuspended in 
DMSO at 40 mg/ml. Pools of peptides of 15 aa overlapping by 1 1 residues were assembled as described 
(Facciabene et al. J. ViroL 78: 8663-72 (2004)). Final concentrations were the following: pool A = L2 
35 mg/ml, pool B 0.89 mg/ml, pool C 0.89 mg/ml, pool D 0.8 mg/ml. Immune response to DOM was 
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monitored using the tetanus toxoid peptide p30 (F947NNFTVSFWLRVPKVSASHLE967 (SEQID 
NO:25)) (Rice et al. J. Immunol. 167: 1558-65 (2001)). Peptides were stored at -80°C. 

EXAMPLE 6 

5 Mice immunization and tumor challenge. 

All animal studies were approved by the IRBM institutional animal care and use 
committee. Female C57BL/6 mice (H-2°) were purchased from Charles River (Lecco, Italy) CEA 
transgenic mice (H-2b) were provided by J. Primus (Vanderbilt University) and kept in standard 
conditions (Clarke et al. Cancer Res. 58:1469-77 (1998)). 

10 C57BL/6 mice were subjected to two DNA injections in quadriceps muscle followed by 

electrical stimulation as previously described (Rizzuto et al. Proc. Natl. Acad. Sci. USA 96 (11): 6417-22 
(1999)). Injections were carried out at three-week intervals. CEA transgenic mice were subjected to 
either 5 weekly injections of plasmid DNA (50ng/injection), 2 injections of Ad vectors (1x109 viral 
particles/injection), or 5 weekly injections followed by a boost with Ad. Two weeks after the last 

15 injection, humoral and cell mediated immune response were analyzed. Mice were also challenged with a 
subcutaneous (s.c.) injection of 5x105 MC38-CEA cells (Clarke et al., supra). At weekly intervals, mice 
were examined for tumor growth. 

EXAMPLE 7 

20 Antibodies detection and titration. 

Sera for antibody titration were obtained by retro-orbital bleeding. ELISA assay was 
performed using highly purified CEA protein as previously described (Facciabene et al., supra). Anti- 
CEA serum titers were calculated as the reciprocal limiting dilution of serum producing an absorbance at 
least 3-fold greater than the absorbance of autologous pre-immune serum at the same dilution. 
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IFN-yELISPOT assay. 

Assays were earned out using mouse splenocytes and CEA-specific peptides as 
previously described (Facciabene et al., supra). Briefly, purified splenocytes were plated in duplicate in 
30 96-wells MAIP plates (Millipore Corp., Billerica, MA) at a density of 5x105 and 2.5x105 cells/well. 

Cells were incubated for 20 h at 37°C with lug/ml suspension of each peptide. Concanavalin A was used 
as positive internal control for each mouse at 5ng/ml. Plates were washed and incubated for 12hrs at 4°C 
with 50ul/well of biotin-conjugated rat anti-mouse IFNy. After extensive washing, 50 jxl/well of NBT/B- 
CIP (Pierce Biotechnology, Inc., Rockford, IL) were added until development of spots was clearly 
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visible. Plates were then washed extensively and air dried. Spots were then counted using an automated 
ELISPOT reader. 

EXAMPLE 9 

5 Cytokine intracellular staining. 

One to two millions mouse PBMC or splenocytes in 1ml RPMI 10% FCS were incubated 
with the indicated pool of peptides (5-6 pig/ml final concentration of each peptide) and brefeldin A (1 
ug/ml; BD Pharmingen) at 37°C for 12-16 hours as previously described (Facciabene et al., supra). Cells 
were washed, stained with surface antibodies, fixed, permeabilized and incubated with TENy-FITC 

10 antibodies (BD Pharmingen). Cells were fixed with 1% formaldehyde solution in PBS and analyzed on a 
FACS-Calibur flow cytometer, using CellQuest software (Becton Dickinson, San Jose, CA). 

EXAMPLE 10 

CEA-DOM and CEA-FcIgG fusions enhance the immunogenicity of the CEA protein, 
j 5 To examine the immune responses induced by the plasmids encoding CEA-FRC, CEA- 

DOM, CEA-VSV-G, CEA-FcIgG, CEA-HSP70 and CEA-LAMP fusions, groups of 9 C57BL/6 mice 
were immunized with two i.m. injections of 50 or 5 jig of each plasmid. The immunizations were three 
weeks apart. In view of the enhanced transduction and immunogenicity reported with electroporation 
(Zucchelli et al. J. Virology 74:11598 (2000), Widera et al., J. Immunol. 164: 4635 (2000)), plasmid 
20 DNAs were routinely electroporated (DNA-EP) into mouse skeletal muscle. 

The immune response elicited by different plasmids was measured by TFNyELISPOT 
assay, 2 weeks after the last injection. Antigen-specific TFN Y secretion from stimulated splenocytes was 
measured using a pool of 15mer peptides overlapping by 1 1 aa and encompassing the C-terminal region 
of CEA (pool D, aa 497-703) (Zucchelli et al., supra). The analysis of the immune response to CEA was 
25 carried out with peptide pool D since the cellular immune response to CEA in C57BL/6 mice is primarily 
biased towards the C-terminal region of this protein (Zuccelli et al., supra). As a negative control, 
cytokine production was also measured upon stimulation of the splenocytes with DMSO at the same 
concentration utilized to solubilize the CEA peptides. 

Injection of pVU/CEA-DOM or pVU/CEA-Fc elicited'a greater immune response to 
30 CEA as compared to pVU/CEA. The greater immunogenicity of these two fusion proteins resulted in 

higher geometric mean values of spot forming cells (SFC) per 10© splenocytes (FIGURE 3A). Plasmids 
pVU/CEA-DOM and pVU/CEA-FcIgG had similar immunogenic properties and exerted a 3- to 4-fold 
increase in CEA-specific immune responses upon injection of 5 or 50 Jig of plasmid DNA (pVU/CEA- 
DOM: 590 and 1098 SFC/106 splenocytes, pVU/CEA-FcIgG: 510 and 1 160, pVU/CEA: 146 and 264 
35 SFC/106 splenocytes, respectively). No significant differences were noted between the SFC values 
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elicited by the pVU/CEA-FrC, pVU/CEA-LAMP, pVU/CEA-HSP70 and pVU/CEA. No CEA specific 
immune responses were detected in negative control samples. 

To determine the effect of the CEA-fusions on the humoral response to CEA, sera from 
immunized mice were tested in ELISA using purified CEA protein as substrate (FIGURE 3B). An 

5 increase in CEA-specific antibody titer was observed upon injection of 50 \ig of plasmids pVl J/CEA- 
DOM, pVU/CEA-FcIgG, pVU/CEA-FrC and P V1J/CEA-HSP70. On the contrary, injection of 
pVU/CEA-LAMP and pVU/CEA-VSV-G resulted in a CEA-specific antibody response similar to that 
observed upon immunization with pVU/CEA. Taken together, these data demonstrate that fusion of the 
CEA coding sequence to the DOM or FcIgG cDNA results in an increase in the CEA-specific cell 

10 mediated and humoral immune response. 

EXAMPLE 11 

CEA-DOM and CEA-FcIgG fusions break tolerance to target antigen in CEA transgenic mice. 

Tolerance to the target antigen is one of the hurdles that a cancer vaccine must overcome 
15 to elicit an immune response and to exert an efficient antitumor effect. Thus, it was deemed appropriate 
to determine whether the enhanced immunogenic properties of CEA-DOM and CEA-FcIgG fusions 
would break tolerance to CEA more efficiently than the CEA protein. To this end, CEA transgenic mice 
were utilized to perform comparative immunization studies. These transgenic mice carry the entire 
human CEA gene and flanking sequences and express the CEA protein in the intestine, mainly in the 
20 cecum and colon. This mouse line is a useful model for studying the safety and efficacy of 
immunotherapy strategies directed against this tumor self antigen (Clarke et al., supra). 

In view of the enhanced immunogenic properties of vectors carrying the codon usage 
optimized cDNA (cDNAopt) of CEA, both plasmid and Adenovirus vectors were engineered to carry the 
cDNAopt of the CEA-DOM (CEA-DOMopt) or CEA-FcIgG (CEA-FcIgGopt) fusions. As observed for 
25 CEA, CEA-DOMopt and CEA-FcIgGopt cDNAs were shown to be expressed with a greater efficiency of 
the corresponding wild type cDNA leading to an enhanced immune response to CEA (data not shown). 

The immunogenicity of these two fusion proteins was compared to that of CEA by a 
series of immunization studies based on the use of plasmid DNA and Ad vectors administered either 
alone or in combination. Cohorts of CEA transgenic mice were immunized with the following varying 
30 regimens: i) 5 injections at weekly intervals of 50 fxg of plasmid DNA (DNA/DNA), ii) 2 biweekly 
injections of Adenovirus in doses ranging from lxK>7 to 1x109 viral particles (vp) of Adenovirus 
(Ad/Ad), or iii) 5 weekly injections of plasmid DNA followed by a final injection of 1x10^ vp of 
Adenovirus (DNA/ Ad). Immune responses were analyzed by intracellular IFNystaining on PBMC or 
splenocytes of each immunized mouse using pool D peptides. Additionally, the induction of CEA- 
35 specific antibodies was monitored by ELISA. 
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DNA/DNA immunization of the CEA transgenic mice revealed that the CEA-DOMopt 
and CEA-FcIgGopt vectors exerted a measurable CD8 + T cell response to the target antigen (FIGURE 
4A). Thus, both constructs were able to break tolerance to CEA in these mice. The antigen specific 
response elicited by CEA-DOM and CEA-FcIgG fusion proteins was comparable as indicated by the 
average values of IFNy intracellular staining (0.22 and 0.34%, respectively). Nonetheless, the immune 
response elicited by these two constructs was greater than that observed upon vaccination with 
pVU/CEAopt (0.07%). Similarly, anti-CEA humoral response was also greater upon vaccination with 
the fusion proteins. CEA-specific antibody titer was detected in all mice immunized with pVU/CEA- 
DOMopt and pVU/CEA-FdgGopt and the average of the antibody titer was 56,136 and 24,725, 
respectively. By contrast, the pVU/CEAopt immunized group showed an at least a 77 fold lower 
average titer of CEA-specific antibody (318) (FIGURE 4B). 

CEA transgenic mice treated with the Ad/Ad vaccination regimen also showed a better 
efficiency in breaking tolerance to CEA upon vaccination with CEA-DOMopt and CEA-FcIgGopt Ad 
vectors than with Ad-CEAopt. A CEA-specific CD8 + T cell response could be observed in the 
vaccinated mice upon injection of a little as 107 vp of Ad-CEA-DOM or Ad-CEA-FcIgG, the CEA- 
specific response was comparable between the two antigens, and increased upon injection of 10^ vp 
(1.55% and 1.15%, respectively). By contrast, 10^ vp of Ad-CEAopt were necessary to elicit significant 
CD8 + T-cell precursor frequencies (2.1%) (FIGURE 4C). CEA-specific antibodies were detected in all 
mice immunized with Ad-CEA-DOM6pt and Ad-CEA-FcIgGopt. The averages of the antibody titer 
were 19,600 and 33,000, respectively. Injection of Ad-CEAopt resulted in a measurable CEA-specific 
response in only 2 of the treated mice, and the antibody titer was significantly lower (Zucchelli et al., 
supra) (FIGURE 4D). Interestingly, the DNA/Ad immunization showed reduced differences in the CD8 + 
T cell precursor frequencies elicited by CEA, CEA-DOM and CEA-FcIgG vectors (FIGURE 5A). 
However, averages of CEA-specific antibody titers were greater upon vaccination with vectors 
expressing CEA-DOM and CEA-FcIgG than CEA (31,200, 26,120 and 412, respectively) (FIGURE 5B). 

Interestingly, regardless of the antigen, no obvious CD4 + cell Thl response to CEA was 
detected in any of the three vaccination regimens (data not shown). However, significant CD4 + cell Thl 
response against to the helper epitope, p30, present within DOM sequences (Rice et al., J. Immunol. 167: 
1558-65 (2001)) were detected after DNA/DNA vaccination (0.4%) (FIGURE 6). 

Thus, these data demonstrate that the CEA-DOM and CEA-FcIgG fusion proteins can 
break tolerance to CEA in transgenic mice with greater efficacy than the CEA protein. The enhanced 
immunogenic properties of these fusion proteins can be observed upon immunization with DNA or Ad 
vectors. However, the greater ability of these two fusion proteins in eliciting CD8 + T cells to CEA can 
be overcome, at least in part, by DNA/Ad vaccination regimen. 
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T-cell depletion studies. 

Immunized animals were depleted of CD4+T cells, CD8 + T cells, NK cells, by i.p. 
injection of anti-CD4 (GK1.5 hybridoma), anti-CD8 (Lyt2.2 hybridoma), or anti Asialo GM1 (Wako 
Chemicals, Richmond, VA) as described {Pemcone et aL, 7. lmmunother. 27(4):273-8 1. (20.04);. Yoon et 
5 aL, J. EthnopharmacoL 93 (2-3):247-53 (2004)). Antibodies (100 jid diluted ascitic fluid/dose) were 
injected on day -7 relative to the tumor challenge and then injected every week for 3 weeks after 
injection of 5x10$ MC38-CEA cells. Depletion conditions were validated by flow cytometry analysis of 
peripheral blood using phycoerythrin-conjugated MAbs anti-CD4, anti-CD8, and anti-NK (PharMingen, 
San Diego, CA); 99% of the relevant cell subset was depleted, whereas all other subsets remained within 
10 normal levels. 

EXAMPLE 13 

CEA-DOM immunization exerts an antitumor effect in CEA transgenic mice. 

We next ascertained whether the increased immunogenicity of the CEA-DOM or CEA- 

15 FcIgG fusions would also lead to an enhanced therapeutic effect, capable of interfering with tumor 
progression. For this purpose, groups of 10 CEA transgenic mice were subjected to the DNA/DNA, 
Ad/Ad or DNA/Ad immunization regimens with the CEA-DOM, CEA-FcIgG, or CEA vectors. Two 
weeks after the last immunization, the CEA transgenic mice were challenged with a subcutaneous 
injection of 5x10^ MC38-CEA cells, a syngenic tumor cell line that expresses CEA (Clarke et aL, supra). 

20 Immunization with DNA/DNA or Ad/ Ad modalities did not result in any significant antitumor effect, 
regardless of the protein expressed by the injected vectors (FIGURE 7). In contrast, DNA-EP/Ad 
immunization with vectors encoding the CEA-DOM fusion protein resulted in a significant antitumor 
effect with 7 out of 10 treated mice remaining tumor free by day 34 post challenge. Thus, these results 
indicate that the enhanced CEA-specific immune response associated with the CEA-DOMopt cDNA, and 

25 the DNA/Ad vaccination regimen correlate with a significant antitumor effect in CEA transgenic mice. 

EXAMPLE 14 

The CEA-DOM antitumor effect is dependent on CD4 + T cells, CD8 + T cells and NK cells. 

The effector cells involved in the antitumor effect observed upon DNA-EP and Ad 

30 immunization with vectors encoding CEA-DOM fusion were characterized. After DNA/Ad 

immunization, but prior to tumor challenge, mice were depleted of CD4+, CD8+ T cells, or NK cells by 
MAbs. Antibodies were given during the course of tumor challenge to ensure continued depletion of the 
relevant NK and T cell subsets. The depletion of all three cell types was monitored by flow cytometry 
analysis using antibodies specific for cell surface markers (data not shown). Depletion of CD4 + , CD8 + T 

35 cells, or NK cells had a negative effect on survival of the immunized mice resulting in the drastic 
reduction of tumor-free mice as compared to the vaccinated group (FIGURE 8). Thus, these data 
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indicate that NK, CD4 + and CD8 + T cells play an important role in the antitumor effect exerted by CEA- 
DOM vaccination. 

EXAMPLE 15 

5 Statistical analysis. 

Where indicated, results were analyzed by the log rank or two tailed Student t test. A p 
value < 0.05 was considered significant. 
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WHAT IS CLAIMED IS: 

1. A nucleic acid molecule comprising a sequence of nucleotides that encodes a 
5 CEA fusion protein, wherein the CEA fusion protein comprises a CEA protein or variant thereof, fused 
to a substantial portion of an immunoenhancing element selected from the group consisting of: DOM, 
FcIgG, CT, and LT; and wherein the fusion protein is capable of producing an immune response in a 
mammal. 

10 2. The nucleic acid molecule of claim 1, wherein the CEA protein is a human CEA 

protein or variant thereof. 

3. The nucleic acid molecule of claim 1, wherein the CEA protein is a rhesus 
monkey CEA protein or variant thereof. 
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4. The nucleic acid molecule of claim 1, wherein the CEA protein is C-terminally 

truncated. 



5. The nucleic acid molecule of claim 4, wherein the C-terminal truncation 
20 comprises amino acids 679 - 702 of SEQ ID NO:3. 

6. The nucleic acid molecule of claim 2, wherein the immunoenhancing element 
comprises a substantial portion of DOM. 

25 7. The nucleic acid molecule of claim 2, wherein the immunoenhancing element 

comprises a substantial portion of FcIgG. 

8. The nucleic acid molecule of claim 1, wherein the sequence of nucleotides is 
selected from the group consisting of SEQ ID NOs:l, 5, 29, 35, and 37. 

30 

9. A nucleic acid molecule that hybridizes under high stringency conditions to the 
complement of the nucleic acid molecule of claim 8. 

10. A vector comprising the nucleic acid molecule of claim 1. 

35 
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vector. 



11. The vector of claim 10, wherein the vector is an adenovirus vector or a plasmid 



12 The vector of claim 1 1, wherein the vector is an Ad 5 vector. 
13. The vector of claim 1 1, wherein the vector is an Ad 6 vector. 
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14. The vector of claim 1 1, wherein the vector is pVl JnsB. 

15. A host cell comprising the vector of claim 1 1 . 



comprising: 



16. A process for expressing a CEA fusion protein in a recombinant host cell, 



(a) introducing a vector comprising the nucleic acid molecule of claim 1 
15 into a suitable host cell; and, 

(b) culturing the host cell under conditions which allow expression of said 

CEA fusion protein. 



20 



17. A purified CEA fusion protein encoded by the nucleic acid molecule of claim 1 . 

18. The purified CEA fusion protein of claim 17, wherein the fusion protein 
comprises a sequence of amino acids selected from the group consisting of: SEQ ID NOS:25, 26, 36 and 
38. 



25 19 - The use of the nucleic acid molecule of claim 1 in a mediciament for the 

treatment of cancer. 
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20. A method of preventing or treating cancer comprising administering to 
mammal the nucleic acid molecule of claim 1. 
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ABSTRACT OF THE DISCLOSURE 

Polynucleotides encoding carcinoembryonic antigen (CEA) fusion proteins are provided, 
the CEA fusion proteins comprising a CEA protein, or functional variant thereof, fused to a substantial 
portion of an immunoenhancing element as described herein. The polynucleotides of the present 
invention can elicit an immune response in a mammal, which, in preferred embodiments, is stronger than 
the immune response elicited by a wild-type CEA. The gene encoding CEA is commonly associated with 
the development of human carcinomas. The present invention provides compositions and methods to 
elicit or enhance immunity to the protein product expressed by the CEA tumor-associated antigen, 
wherein aberrant CEA expression is associated with a carcinoma or its development. This invention 
specifically provides adenoviral vector and plasmid constructs carrying polynucleotides encoding CEA 
fusion proteins and discloses their use in vaccines and pharmaceutical compositions for preventing and 
treating cancer. 
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MESP SAP PHR WCIP W Q R • 
1 ATGGAGAGCC CCAGCGCCCC CCCCCACCGC TGGTGCATCC CCTGGCAGCG 

-LliL TASL L T F WNP P T T A • 
51 CCTGCTGCTG ACCGCCAGCC TGCTGACCTT CTGGAACCCC CCCACCACCG 

KLT I E S T F F N V A E GKE 
101 CCAAGCTGAC CATCGAGAGC ACCCCCTTCA ACGTGGC CG A GGGCAAGGAG 

VLLL V H N LPQ H L F G Y S W • 
151 GTGCTGCTGC TGGTGCACAA CCTGCCCCAG CACCTGTTCG GCTACAGCTG 

•YKG ERVD GNR QII G Y V I ■ 
201 GTACAAGGGC GAGCGCGTGG ACGGCAACCG CCAGATCATC GGCTACGTGA 

GTQ QAT PGPA YSG REI 
251 TCGGCACCCA GCAGGCCACC CCCGGCCCCG CCTACAGCGG CCGCGAGATC 

IYPN ASLi L I Q NIIQ NDT- 
301 ATCTACCCCA ACGCCAGCCT GCTGATCCAG AACATCATCC AGAACGACAC 

•GFY TLHV IKS DLV N E E A • 
351 CGGCTTCTAC ACCCTGCACG TGATCAAGAG CGACCTGGTG AACGAGGAGG 

TGQ FRV YPEL PKP SIS 
401 CCACCGGCCA GTTCCGCGTG TACCCCGAGC TGCCCAAGCC CAGCATCAGC 

SNNS KPV EDK D A V A FTC- 
451 AGCAACAACA GCAAGCCCGT GGAGGACAAG GACGCCGTGG CCTTCACCTG 

•EPE TQDA TYL WWV N N Q S • 
501 CGAGCCCGAG ACCCAGGACG CCACCTACCT GTGGTGGGTG AACAACCAGA 

L» P V SPR LQIiS NGN RTL 
551 GCCTGCCCGT GAGCCCCCGC CTGCAGCTGA GCAACGGCAA CCGCACCCTG 

TLFN VTR NDT ASYK C E T • 
601 ACCCTGTTCA ACGTGACCCG CAACGACACC GCCAGCTACA AGTGCGAGAC 

» Q N P VSAR RSD SVI L N V L» ♦ 
651 CCAGAACCCC GTGAGCGCCC GCCGCAGCGA CAGCGTGATC CTGAACGTGC 

YGP DAP TISP LNT SYR 
701 TGTACGGCCC CGACGCCCCC ACCATCAGCC CCCTGAACAC CAGCTACCGC 

SGEN LNL SCH AASN PPA- 
7 51 AGCGGC GAGA ACCTGAACCT GAGCTGCCAC GCCGCCAGCA ACCCCCCCGC 

■QYS WFVN GTF QQS TQEL- 
801 CCAGTACAGC TGGTTCGTGA ACGGCACCTT CCAGCAGAGC ACCCAGGAGC 

FIP NIT VNNS GSY TCQ 
851 TGTTCATCCC CAACATCACC GTGAACAACA GCGGCAGCTA CACCTGCCAG 

AHNS DTG LNR TTVT TIT- 
901 GCCCACAACA GCGACACCGG CCTGAACCGC ACCACCGTGA CCACCATCAC 

■ V Y A EPPK PFI TSN NSNP- 
951 CGTGTACGCC GAGCCCCCCA AGCCCTTCAT CACCAGCAAC AACAGCAACC 

VED EDA VALT CEP EIQ 
10 01 CCGTGGAGGA CGAGGACGCC GTGGCCCTGA CCTGCGAGCC CGAGATCCAG 

NTTY LWW VNN QSLP VSP- 
1051 AACACCACCT ACCTGTGGTG GGTGAACAAC CAGAGCCTGC CCGTGAGCCC 

•RLQ LSND NRT LTL LSVT- 
1101 CCGCCTGCAG CTGAGCAACG ACAACCGCAC CCTGACCCTG CTGAGCGTGA 

RND VGP YEGG I Q N ELS 
1151 CCCGCAACGA CGTGGGCCCC TACGAGTGCG GCATCCAGAA CGAGCTGAGC 

VDHS DPV ILN VliYG PDD- 
12 01 GTGGACCACA GCGACCCCGT GATCCTGAAC GTGCTGTACG GCCCCGACGA 

•PTI SPSY TYY RPG VNLS- 

12 51 CCCCACCATC AGCCCCAGCT ACACCTACTA CCGCCCCGGC GTGAACCTGA 

L S C HAA SNPP AQY SWL 

13 01 GCCTGAGCTG CCACGCCGCC AGCAACCCCC CCGCCCAGTA CAGCTGGCTG 

IDGN I Q Q HTQ ELFI SNI- 
13 51 ATCGACGGCA ACATCCAGCA GCACACCCAG GAGCTGTTCA TCAGCAACAT 

-TEK NSGL YTC QAN NSAS- 
1401 CACCGAGAAG AACAGCGGCC TGTACACCTG CCAGGCCAAC AACAGCGCCA 

GHS RTT VKTI TVS AEL 
1451 GCGGCCACAG CCGCACCACC GTGAAGACCA TCACCGTGAG CGCCGAGCTG 
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PKPS I S S NNS KPVE D K D • 
1501 CCCAAGCCCA GCATCAGCAG CAACAACAGC AAGCCCGTGG AGGACAAGGA 

•AVA FTCE PEA Q N T T Y I* W 
1551 CGCCGTGGCC TTCACCTGCG AGCCCGAGGC CCAGAACACC ACCTACCTGT 

WVN GQS L, P V S P R L, QLS 
1601 GGTGGGTGAA CGGCCAGAGC CTGCCCGTGA GCCCCCGCCT GCAGCTGAGC 

NGNR T L. T IiFN V T R N D A R • 
1651 AACGGCAACC GCACCCTGAC CCTGTTCAAC GTGACCCGCA ACGACGCCCG 

• A Y V CGIQ N S V SAN RSDP 
1701 CGCCTACGTG TGCGGCATCC AGAACAGCGT GAGCGCCAAC CGCAGCGACC 

V T L DVL YGPD T P I ISP 
1751 CCGTGACCCT GGACGTGCTG TACGGCCCCG ACACCCCCAT CATCAGCCCC 

PDSS YLS G A N L, N L# S CHS- 
.1801 CCCGACAGCA GCTACCTGAG CGGCGCCAAC CTGAACCTGA GCTGCCACAG 

• A S N PSPQ Y S W R I N GIPQ 
1851 CGCCAGCAAC CCCAGCCCCC AGTACAGCTG GCGCATCAAC GGCATCCCCC 

Q H T Q V L FIAK ITP NNN 
1901 AGCAGCACAC CCAGGTGCTG TTCATCGCCA AGATCACCCC CAACAACAAC 

G T Y A CFV SNIi ATGR NNS- 
1951 GGCACCTACG CCTGCTTCGT GAGCAACCTG GCCACCGGCC GCAACAACAG 

• I V K S I T V SAS GTS RSTP 
2001 CATCGTGAAG AGCATCACCG TGAGCGCCAG CGGCACC TCT AGAAGCACCC 

I P F SYS K N I* D CWV DNE 
2051 CCATCCCATT CAGCTACAGC AAGAACCTGG ACTGCTGGGT GGACAACGAG 

EDID VII, KKS T I L. N L D I • 
2101 GAGGACATCG ACGTGATCCT GAAGAAGAGC ACCATCCTGA ACCTGGACAT 

-NND IISD ISG FNS SVIT 
2151 CAACAACGAC ATCATCAGCG ACATCAGCGG CTTCAACAGC AGCGTGATCA 

YPD A Q I* VPGI NGK A I H 
2201 CCTACCCCQA CGCCCAGCTG GTGCCCGGCA TCAACGGCAA GGCCATCCAC 

L.VNN ESS EVI VHKA M D I • 
2251 CTGGTGAACA AC G AG AGC AG CGAGGTGATC GTGCACAAGG CCATGGACAT 

- 33 Y N DMFN N F T VSF WLRV 
2301 CGAGTACAAC GACATGTTCA ACAACTTCAC CGTGAGCTTC TGGCTGAGAG 

- PKV SAS HLEQ Y G T NEY 
23 51 TGCCTAAGGT GAGCGCCAGC CACCTGGAGC AGTACGGCAC CAACGAGTAC 

SIIS SMK KHS L. S I G SGW- 
2401 AGCATCATCA GCAGCATGAA GAAGCACAGC CTGAGCATCG GCAGCGGCTG 

-SVS L.KGN NL.I WTL KDSA 
2451 GAGCGTGAGC CTGAAGGGCA ACAACCTCAT CTGGACCCTG AAQG AT AGC G 

- GEV RQI T F R D LPD KFN 
2501 CCGGAGAGGT GAGACAGATC ACCTTCAGAG ACCTGCCCGA CAAGTTCAAT 

A Y L A NKW VFI T I T N D R 1» • 
2 551 GCCTACCTGG CCAACAAGTG GGTGTTCATC ACCATCACCA ACGACAGACT 

-SSA NL.YI NGV LMG SAEI 
2 601 OAGCAGCGCC AACCTGTACA TCAACGGCGT GCTCATGGGC AGCGCCGAGA 

T G L. G A I REDN NIT LKL. 
2 651 TCACCGGCCT GGGCGCCATC AGAGAGGACA ACAACATCAC CCTGAAGCTG 

DRCN NNN QYV SIDK F R I • 
2701 GACAGATGCA ACAACAACAA CCAGTACGTG AGCATC G AC A AGTT C C G GAT 

- F C K A Ii N P KEI E K L YTSY 
27 51 CTTCTGCAAG GCCCTGAACC CCAAGGAGAT CGAGAAGCTG TACAC CAGCT 

LSI T F I, RDFW GNP DRY 
2801 ACCTGAGCAT CACCTTCCTG AGAGACTTCT GGGGCAACCC CCTGAGATAC 

D T * (SEQ ID NO:l) 
2851 G AC ACC TAG {SEQ ID NO: 25) 
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MESP SAP PHR WCIP WQR LLL 

1 ATGGAGAGCC CCAGCGCCCC CCCCCACCGC TGGTGCATCC CCTGGCAGCG CCTGCTGCTG 

TASL L T F W N P P T T A K Li T IES 

61 ACCGCCAGCC TGCTGACCTT CTGGAACCCC CCCACCACCG CCAAGCTGAC CATCGAGAGC 

T P F N VAE GKE V'LLI* VHN I* P Q 

121 ACCCCCTTCA ACGTGGCCGA GGGCAAGGAG GTGCTGCTGC TGGTGCACAA CCTGCCCCAG 

HLFG YSW YKG ERVD GNR Q I I 

181 CACCTGTTCG GCTACAGCTG GTACAAGGGC GAGCGCGTGG ACGGCAACCG CCAGATCATC 

GYVI GTQ QAT PGPA YSG R E I 

241 GGC TACGTG A TCGGCACCCA GCAGGCCACC CCCGGCCCCG CCTACAGCGG CCGCGAGATC 

IYPN ASL LIQ NIIQ NDT GFY 

301 ATCTACCCCA ACGCCAGCCT GCTGATCCAG AACATCATCC AGAACGACAC CGGCTTCTAC 

T L» H V IKS DLV NEEA T G Q FRV 

3 61 ACCCTGCACG TGATCAAGAG CGACCTGGTG AACGAGGAGG CCACCGGCCA GTTCCGCGTG 

Y P E L PKP SIS SNNS KPV EDK 
421 TACCCCGAGC TGCCCAAGCC CAGCATCAGC AGCAACAACA GCAAGCCCGT GGAGGACAAG 

D A V A FTC EPE T Q D A TYL WWV 

481 GACGCCGTGG CCTTCACCTG CGAGCCCGAG ACCCAGGACG CCACCTACCT GTGGTGGGTG 

NNQS LPV SPR It Q Li S NGN R T I* 

541 AACAACCAGA GCCTGCCCGT GAGCCCCCGC CTGCAGCTGA GCAACGGCAA CCGCACCCTG 

TLFN VTR NDT ASYK CET QNP 

6 01 ACC CTGTTC A ACGTGACCCG CAACGACACC GCCAGCTACA AGTGCGAGAC CCAGAACCCC 

VSAR RSD SVI LNVL YGP DAP 

661 GTGAGCGCCC GCCGCAGCGA CAGCGTGATC CTGAACGTGC TGTACGGCCC CGACGCCCCC 

TISP LNT SYR SGEN L» N L SCH 

721 ACCATCAGCC CCCTGAACAC CAGCTACCGC AGCGGCGAGA ACCTGAACCT GAGCTGCCAC 

AASN PPA QYS WFVN GTF QQS 

7 81 GCCGCCAGCA ACCCCCCCGC CCAGTACAGC TGGTTCGTGA ACGGCACCTT CCAGCAGAGC 

T Q E L» FIP NIT VNNS GSY TCQ 

841 ACCCAGGAGC TGTTCATCCC CAACATCACC GTGAACAACA GCGGCAGCTA CACCTGCCAG 

AHNS DTG I» N R TTVT TIT V Y A 

901 GCCCACAACA GCGACACCGG CCTGAACCGC AC C AC CGTGA CCACCATCAC CGTGTACGCC 

EPPK PFI TSN NSNP VED EDA 

9 61 GAGCCCCCCA AGCCCTTCAT CACCAGCAAC AACAGCAACC CC GTGGAGG A CGAGGACGCC 

V A Ii T CEP EIQ NTTY LWW VNN 
1021 GTGGCCCTGA CCTGCGAGCC CGAGATCCAG AACACCACCT ACCTGTGGTG GGTGAACAAC 

QSLP VSP R Ii Q LSND NRT LTL 

1081 CAGAGCCTGC CCGTGAGCCC CCGCCTGCAG CTGAGCAACG ACAACCGCAC CCTGACCCTG 

LSVT RND VGP YECG IQN ELS 

1141 CTGAGCGTGA CCCGCAACGA CGTGGGCCCC TACGAGTGCG GCATCCAGAA CGAGCTGAGC 

VDHS DPV I It N VLYG PDD PTI 

12 01 GTGGACCACA GCGACCCCGT GATCCTGAAC GTGCTGTACG GCCCCGACGA CCCCACCATC 

SPSY TYY RPG VNLS LSC HAA 

12 61 AGCCCCAGCT ACACCTACTA CCGCCCCGGC GTGAACCTGA GCCTGAGCTG CCACGCCGCC 

SNPP AQY SWL IDGN I Q Q HTQ 

13 21 AGCAACCCCC CCGCCCAGTA CAGCTGGCTG ATCGACGGCA ACATCCAGCA GCACACCCAG 

ELFI SNI TEK NSGL YTC QAN 

1381 GAGCTGTTCA TCAGCAACAT CACCGAGAAG AACAGCGGCC TGTACACCTG CCAGGCCAAC 

NSAS GHS RTT VKTI TVS AEL 

1441 AACAGCGCCA GCGGCCACAG CCGCACCACC GTGAAGACCA TCACCGTGAG CGCCGAGCTG 

PKPS ISS NNS KPVE DKD AVA 

1501 CCCAAGCCCA GCATCAGCAG CAACAACAGC AAGCC CGTGG AGG AC AAGG A CGCCGTGGCC 

FTCE PEA QNT TYLW WVN GQS 

1561 TTCACCTGCG AGCCCGAGGC CCAGAACACC ACCTACCTGT GGTGGGTGAA CGGCCAGAGC 

LPVS PRL QIjS NGNR TLT LFN 
1621 CTGCCCGTGA GCCCCCGCCT GCAGCTGAGC AACGGCAACC GCACCCTGAC CCTGTTCAAC 

VTRN DAR AYV CGIQ NSV SAN 
1681 GTGACCCGCA ACGACGCCCG CGCCTACGTG TGCGGCATCC AGAACAGCGT GAGCGCCAAC 

RSDP VTL DVL YGPD TPI ISP 
1741 CGCAGCGACC CCGTGACCCT GGACGTGCTG TACGGCCCCG ACACCCCCAT CATCAGCCCC 
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PDSS YLS G A N L N L S CHS A S N 
1801 CCCGACAGCA GCTACCTGAG CGGCGCCAAC CTGAACCTGA GCTGCCACAG CGCCAGCAAC 

PSPQ YSW R I N GIPQ QHT QVL 
1861 CCCAGCCCCC AGTAC AGCTG GCGCATCAAC GGCATCCCCC AGCAGCACAC CCAGGTGCTG 

FIAK I T P NNN G T Y A CFV SNL 
1921 TTCATCGCCA AGATCACCCC CAACAACAAC GGCACCTACG CCTGCTTCGT GAGCAACCTG 

ATGR NNS IVK SlTV SAS GTS 
1981 GCCACCGGCC GCAACAACAG CATCGTGAAG AGCATCACCG TGAGCGCCAG CGGCACCTCT 

RKTH TCP PCP APEL LGG PSV 
2041 AGAAAGACC C ACACCTGCCC CCCTTGCCCT GCCCCTGAGC TGCTGGGCGG ACCCAGCGTG 

FLFP PKP KDT L.MIS RTP EVT 
2101 TTCCT6TTCC CCCCCAAGCC TAAGGACACC CTCATGATCA GCAGAACCCC CGAGGTGACC 

CVVV DVS HED P EVK FNW YVD 
2161 TGCGTGGTGG TGGACGTGAG C C AC GAGG AT CCCGAGGTGA AGTTCAACTG GTACGTGGAC 

GVEV HNA KTK PREE QYN STY 
2221 GGCGTGGAGG TGCACAATGC CAAGACCAAG CCCAGAGAGG AGCAGTACAA CAGCACCTAC 

RVVS VLT VLH Q D W 1» NGK EYK 
22 81 AGAGTGGTGA GCGTGCTCAC CGTGCTGCAC CAGGATTGGC TGAACGGCAA GGAGTACAAG 

CKVS NKA LPA PIEK TIS KAK 
2341 TGCAAG GTG A GCAACAAGGC CCTGCCTGCC CCCATCGAGA AAACCATCAG CAAGGCCAAG 

GQPR EPQ VYT LPPS RDE L T K 
2401 GGCCAGCCCA GAGAGCCCCA GGTGTACACC CTGCCCCCTA GCAGAGATGA GTTGACCAAG 

NQVS LTC It V K GFYP SDI AVE 
2461 AACCAGGTGA GCCTCACATG CCTGGTGAAG GGCTTCTACC CCAGCGACAT CGCCGTGGAG 

WESN GQP ENN YKTT PPV LDS 
2521 TGGGAGAGCA ACGGCCAGCC CGAGAACAAC TACAAGACCA CCCCCCCTGT GCTGGACAGC 

DGSF FLY SKL T V D K SRW QQG 
2581 GATGGCAGCT TCTTCCTGTA CAGCAAGCTC ACCGTGGACA AGAGCAGATG GCAGCAGGGC 

NVFS CSV MHE ALHN HYT QKS 
2 641 AACGTGTTCA GCTGCAGCGT GATGCACGAG GCCCTGCACA ATCACTACAC CCAGAAGAGC 

LSIiS PGK * (SEQ ID NO:26) 
2701 CTGAGCCTGA GCCCCGGCAA GTAA (SEQ ID NO: 5) 

FIGURE 10 
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atggagtctc 
acagcctcac 
acgccgttca 
catctttttg 
ggatatgtaa 
atatacccca 
accctacacg 
tacccggagc 
gatgctgtgg 
aacaatcaga 
actctattca 
gtgagtgcca 
accatttccc 
gcagcctcta 
acccaagagc 
gcccataact 
gagccaccca 
gtagccttaa 
cagagcctcc 
ctcagtgtca 
gttgaccaca 
tccccctcat 
tctaacccac 
gagctcttta 
aactcagcca 
cccaagccct 
ttcacctgtg 
ctcccagtca 
gtcacaagaa 
cgcagtgacc 
ccagactcgt 
ccatccccgc 
tttatcgcca 
gctactggcc 



cctcggcccc 
ttctaacctt 
atgtcgcaga 
gctacagctg 
taggaactca 
atgcatccct 
tcataaagtc 
tgcccaagcc 
ccttcacctg 
gcctcccggt 
atgtcacaag 
ggcgcagtga 
ctctaaacac 
acccacctgc 
tctttatccc 
cagacactgg 
aacccttcat 
cctgfcgaacc 
cggtcagtcc 
caaggaatga 
gcgacccagt 
acacctatta 
ctgcacagta 
tctccaacat 
gtggccacag 
ccatctccag 
aacctgaggc 
gtcccaggct 
atgacgcaag 
cagtcaccct 
cttacctttc 
agtattcttg 
aaatcacgcc 
gcaataattc 



tccccacaga 
ctggaacccg 
ggggaaggag 
gtacaaaggt 
acaagctacc 
gctgatccag 
agatcttgtg 
ctccatctcc 
tgaacctgag 
cagtcccagg 
aaatgacaca 
ttcagtcatc 
atcttacaga 
acagtactct 
caacatcact 
cctcaatagg 
caccagcaac 
tgagattcag 
caggctgcag 
tgtaggaccc 
catcctgaat 
ccgtccaggg 
tfcctfcggctg 
cactgagaag 
caggactaca 
caacaactcc 
tcagaacaca 
gcagctgtcc 
agcctatgta 
ggatgtcctc 
gggagcgaac 
gcgtatcaat 
aaataataac 
catagtcaag 
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tggtgcatcc 
cccaccac tg 
gtgcttctac 
gaaagagtgg 
ccagggcccg 
aacatcatcc 
aatgaagaag 
agcaacaact 
actcaggacg 
ctgcagctgt 
gcaagctaca 
ctgaatgtcc 
tcaggggaaa 
tggtttgtca 
gtgaataata 
accacagtca 
aactccaacc 
aacacaacct 
ctgtccaatg 
tatgagtgtg 
gtcctctatg 
gtgaacctca 
afctgatggga 
aacagcggac 
gtcaagacaa 
aaacccgtgg 
acctacctgt 
aatggcaaca 
tgtggaatcc 
tatgggccgg 
ctcaacctct 
gggataccgc 
gggacctatg 
agcatcacag 



cctggcagag 
ccaagctcac 
ttgtccacaa 
atggcaaccg 
catacagtgg 
agaatgacac 
caactggcca 
ccaaacccgt 
caacctacct 
ccaatggcaa 
aatgtgaaac 
tctatggccc 
atctgaacct 
atgggacttt 
gtggatccta 
cgacgatcac 
ccgtggagga 
acctgtggtg 
acaacaggac 
gaatccagaa 
gcccagacga 
gcctctcctg 
acatccagca 
tctatacctg 
tcacagtctc 
aggacaagga 
ggtgggtaaa 
ggaccctcac 
agaactcagt 
acacccccat 
cctgccactc 
agcaacacac 
cctgtfcfctgt 
tctctgcatc 



gctcctgctc 60 
tattgaatcc 120 
tctgccccag 180 
tcaaat-tata 240 
tcgagagata 3 00 
aggattctac 3 60 
gttccgggta 420 
ggaggacaag 480 
gtggtgggta 540 
caggaccctc 600 
cc agaac cca 6 60 
ggatgccccc 7 20 
ctcctgccac 780 
ccagcaatcc 840 
tacgtgccaa 900 
agtctatgca 9 60 
tgaggatgct 1020 
ggtaaataat 1080 
cctcactcta 1140 
cgaattaagt 1200 
ccccaccatt 1260 
ccatgcagcc 1320 
acacacacaa 1380 
ccaggccaat 1440 
tgcggagctg 1500 
tgcfcgtggcc 1560 
tggtcagagc 1620 
tctattcaat 1680 
gagtgcaaac 1740 
catttccccc 1800 
ggcctctaac 1860 
acaagttctc 1920 
ctctaacttg 1980 
tggaact (SEQ ID NO:2) 



1 MESPSAPPHR WCIPWQRLLL TASL/LTFWNP PTTAKLTIES TPFNVAEGKE 

51 VLLLVHISTLPQ HLFGYSWYKG ERVDGNRQII GYVIGTQQAT PGPAYSGREI 

101 IYPNASLLIQ NIIQNDTGFY TLHVIKSDLV NEEATGQFRV YPELPKPSIS 

151 SNNSKPVEDK DAVAFTCEPE TQDATYLWWV NNQSLPVSPR LQIiSNGNRTL 

201 TLFNVTKNDT ASYKCETQNP VSARRSDSVI LNVLYGPDAP TISPLNTSYR 

251 SGENLNLSCH AASNPPAQYS WFVNGTFQQS TQELFIPNIT VNNSGSYTCQ 

301 AHNSDTGLNR TTVTTITVYA EPPKPFITSN NSNPVEDEDA VALTCEPEIQ 

351 NTTYLWWVNN QSLPVSPRLQ LSNDNRTLTL LSVTRNDVGP YECGIQNELS 

401 VDHSDPVILN VLYGPDDPTI SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL 

451 IDGNIQQHTQ ELFXSNITEK NSGLYTCQAN NSASGHSRTT VKTITVSAEL 

501 PKPSISSNNS KPVEDKDAVA FTCEPEAQNT TYLWWVNGQS LPVSPRLQLS 

551 NGNRTLTLFN VTRNDARAYV CGIQNSVSAN RSDPVTLDVL, YGPDTPIISP 

601 PDSSYLSGAN LNLSCHSASN PSPQYSWRIN GIPQQHTQVL FIAKITPNNN 

651 GT Y AC F VSNLt ATGRNNSIVK SITVSASGT (SEQ ID NO : 3 ) 
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DSTP IPF SYS KNLD CWV DNE 

1 GATTCAACAC CAATTCCATT TTCTTATTCT AAAAATCTGG ATTGTTGGGT TGATAATGAA 

EDID V I It KKS T I la N LDI NND 

61 GAAGATATAG ATGTTATATT AAAAAAGAGT ACAATTTTAA ATTTAGATAT TAATAATGAT 

IISD I S G F N S S V I T YPD AQL 

121 ATTATATCAG ATATATCTGG GTTTAATTCA TCTGTAATAA CATATCCAGA TGCTCAATTG 

VPGI NGK A I H LVNN ESS EVI 

181 GTGCCCGGAA TAAATGGCAA AGCAATACAT TTAGTAAACA ATGAATCTTC TGAAGTTATA 

VHKA MDI EYN DMFN NFT VSF 

241 GTGCATAAAG CTATGGATAT TGAATATAAT GATATGTTTA ATAATTTTAC CGTTAGCTTT 

WDRV PKV SAS H L» E Q YGT NEY 

3 01 TGGTTGAGGG TTCCTAAAGT ATCTGCTAGT CATTTAGAAC AATATGGCAC AAATGAGTAT 

SIIS SMK KHS LSIG SGW SVS 

361 TCAATAATTA GCTCTATGAA AAAACATAGT CTATCAATAG GATCTGGTTG GAGTGTATCA 

LKGN NLI WTL KDSA GEV RQI 

421 CTTAAAGGTA ATAAC TTAAT ATGGACTTTA AAAGATTCCG CGGGAGAAGT TAGACAAATA 

TFRD LPD KFN AYLA NKW VFI 

481 ACTTTTAGGG ATTTACCTGA TAAATTTAAT GCTTATTTAG CAAATAAATG GGTTTTTATA 

T I T N DRL SSA NLYI NGV L.MG 

541 ACT AT TACT A ATGATAGATT ATCTTCTGCT AATTTGTATA TAAATGGAGT ACTTATGGGA 

SAEI TGL G A I RE DN NIT It K L 

601 AGTGCAGAAA TTACTGGTTT AGGAGC TATT AGAGAGGATA ATAATATAAC ATTAAAACTA 

DRCN NNN QYV S I D K FRI FCK 

661 GATAGATGTA ATAATAATAA TCAATACGTT TCTATTGATA AATTTAGGAT ATTTTGCAAA 

ALNP KEI EKL YTSY LSI T F la 

721 GCATTAAATC CAAAAGAGAT TGAAAAATTA T AC AC AAGTT ATTTATCTAT AACCTTTTTA 

RDFW GNP LRY DTDR *(SEQ ID NO:28) 

7 81 AGAGACTTCT GGGGAAACCC TTTACGATAT GATACAGATA GGTAG ( SEQ ID NO: 27) 

FIGURE 12 



USPTO from the IFW Image Database on 05/17/2005 



15/24 



1 ATGGAGTCTC CCTCGGCCCC TCCCCACAGA TGGTGCATCC CCTGGCAGAG GCTCCTGCTC 

61 ACAGCCTCAC TTCTAACCTT CTGGAACCCG CCCACCACTG CCAAGCTCAC TATTGAATCC 

121 ACGCCGTTCA ATGTCGCAGA GGGGAAGGAG GTGCTTCTAC TTGTCCACAA TCTGCCCCAG 

181 CATCTTTTTG GCTACAGCTG GTACAAAGGT GAAAGAGTGG ATGGCAACCG TCAAATTATA 

241 GGATATGTAA TAGGAACTCA ACAAGCTACC CCAGGGCCCG CATACAGTGG TCGAGAGATA 

3 01 ATATACCCCA ATGCATCCCT GCTGATCCAG AACATCATCC AGAATGACAC AGGATTCTAC 

3 61 ACCCTACACG TCATAAAGTC AGATCTTGTG AATGAAGAAG CAACTGGCCA GTTCCGGGTA 

421 TACCCGGAGC TGCCCAAGCC CTCCATCTCC AGCAACAACT CCAAACCCGT GGAGGACAAG 

481 GATGCTGTGG CCTTCACCTG TGAACCTGAG ACTCAGGACG CAACCTACCT GTGGTGGGTA 

541 AACAATCAGA GCCTCCCGGT CAGTCCCAGG CTGCAGCTGT CCAATGGCAA CAGGACCCTC 

601 ACTCTATTCA ATGTCACAAG AAATGACACA GCAAGCTACA AATGTGAAAC CCAGAACCCA 

661 GTGAGTGCCA GGCGCAGTGA TTCAGTCATC CTGAATGTCC TCTATGGCCC GGATGCCCCC 

721 ACCATTTCCC CTCTAAACAC ATCTTACAGA TCAGGGGAAA ATCTGAACCT CTCCTGCCAC 

7 81 GCAGCCTCTA ACCCACCTGC ACAGTACTCT TGGTTTGTCA ATGGGACTTT CCAGCAATCC 

841 ACCCAAGAGC TCTTTATCCC CAACATCACT GTGAATAATA GTGGATCCTA TACGTGCCAA 

901 GCCCATAACT CAGACACTGG CCTCAATAGG ACCACAGTCA CGACGATCAC AGTCTATGCA 

961 GAGCCACCCA AACCCTTCAT CACCAGCAAC AACTCCAACC CCGTGGAGGA TGAGGATGCT 

1021 GTAGCCTTAA CCTGTGAACC TGAGATTCAG AACACAACCT AC CTGTGGTG GGTAAATAAT 

1081 CAGAGCCTCC CGGTCAGTCC CAGGCTGCAG CTGTCCAATG ACAACAGGAC CCTCACTCTA 

1141 CTCAGTGTCA CAAGGAATGA TGTAGGACCC TATGAGTGTG GAATCCAGAA CGAATTAAGT 

12 01 GTTGACCACA GCGACCCAGT CATCCTGAAT GTCCTCTATG GCCCAGACGA CCCCACCATT 

1261 TCCCCCTCAT ACACCTATTA CCGTCCAGGG GTGAACCTCA GCCTCTCCTG CCATGCAGCC 

1321 TCTAACCCAC CTGCACAGTA TTCTTGGCTG ATTGATGGGA ACATCCAGCA ACACACACAA 

1381 GAGCTCTTTA TCTCCAACAT CACTGAGAAG AACAGCGGAC TCTATACCTG CCAGGCCAAT 

1441 AACTCAGCCA GTGGC C AC AG CAGGACTACA GTCAAGACAA TCACAGTCTC TGCGGAGCTG 

1501 CCCAAGCCCT CCATCTCCAG CAACAACTCC AAACCCGTGG AGGACAAGGA TGCTGTGGCC 

1561 TTCACCTGTG AACCTGAGGC TCAGAACACA ACCTACCTGT GGTGGGTAAA TGGTC AGAGC 

1621 CTCCCAGTCA GTCCCAGGCT GCAGCTGTCC AATGGCAACA GGACCCTCAC TCTATTCAAT 

1681 GTCACAAGAA ATGACGCAAG AGCCTATGTA TGTGGAATCC AGAACTCAGT GAGTGC AAAC 

1741 CGCAGTGACC CAGTCACCCT GGATGTCCTC TATGGGCCGG ACACCCCCAT CATTTCCCCC 

18 01 CCAGACTCGT CTTACCTTTC GGGAGCGAAC CTCAACCTCT CCTGCCACTC GGCCTCTAAC 

1861 CCATCCCCGC AGTATTCTTG GCGTATCAAT GGGATACCGC AGCAACACAC ACAAGTTCTC 

1921 TTTATCGCCA AAATCACGCC AAATAATAAC GGGACCTATG CCTGTTTTGT CTCTAACTTG 

1981 GCTACTGGCC GCAATAATTC CATAGTCAAG AGCATCACAG TCTCTGCATC TGGAACTCTA 

2041 GATTCAACAC CAATTCCATT TTCTTATTCT AAAAATCTGG ATTGTTGGGT TGATAATGAA 

2101 GAAGATATAG ATGTTATATT AAAAAAGAGT ACAATTTTAA ATTTAGATAT TAATAATGAT 

2161 ATTATATCAG ATATATCTGG GTTTAATTCA TCTGTAATAA CATATCCAGA TGCTCAATTG 

2221 GTGCCCGGAA TAAATGGCAA AGCAATACAT TTAGTAAACA ATGAATCTTC TGAAGTTATA 

2281 GTGCATAAAG CTATGGATAT TGAATATAAT GATATGTTTA ATAATTTTAC CGTTAGCTTT 

2341 TGGTTGAGGG TTCCTAAAGT ATCTGCTAGT CATTTAGAAC AATATGGCAC AAATGAGTAT 

2401 TCAATAATTA GCTCTATGAA AAAACATAGT CTATCAATAG GATCTGGTTG GAGTGTATCA 

2461 CTTAAAGGTA ATAACTTAAT ATGGACTTTA AAAGATTCCG CGGGAGAAGT TAGACAAATA 

2521 ACTTTTAGGG ATXTACCTGA TAAATTTAAT GCTTATTTAG CAAATAAATG GGTTTTTATA 

2 581 ACTATTACTA ATGATAGATT ATCTTCTGCT AATTTGTATA TAAATGGAGT ACTTATGGGA 

2 641 AGTGCAGAAA TTACTGGTTT AGGAGCTATT AGAGAGGATA ATAATATAAC AXTAAAACTA 

2701 GATAGATGTA ATAATAATAA TCAATACGTT TCTATTGATA AATTTAGGAT ATTTTGCAAA 

2 761 GCATTAAATC CAAAAGAGAT TGAAAAATTA TACACAAGTT ATTTATCTAT AACCTTTTTA 

2 821 AGAGACTTCT GGGGAAACCC TTTACGATAT GATATAG (SEQ ID NO : 2 9 ) 



FIGURE 13 



16/24 

1 ATGGGGTCTC CCTCAGCCCC TCTTCACAGA TGGTGCATCC CCTGGCAGAC 

51 GCTCCTGCTC ACAGCCTCAC TTCTAAC CTT CTGGAACCCG CCCACCACTG 

101 CCCAGCTCAC TATTGAATCC AGGCCGTTCA ATGTTGCAGA GGGGAAGGAG 

151 GTTCTTCTAC TTGCCCACAA TGTGTCCCAG AATCTTTTTG GCTACATTTG 

2 01 GTACAAGGGA GAAAGAGTGG ATGCCAGCCG TCGAATTGGA TCATGTGTAA 
251 TAAGAACTCA ACAAATTACC CCAGGGCCCG CACACAGCGG TCGAGAGACA 
301 ATAGACTTCA ATGCATCCCT GCTGATCCAC AATGTCACCC AGAGTGACAC 

3 51 AGGATCCTAC ACCATACAAG TCATAAAGGA AGATCTTGTG AATGAAGAAG 
401 CAACTGGCCA GTTCCGGGTA TACCCGGAGC TGCCCAAGCC CTACATCTCC 
451 AGCAACAACT CCAACCCCGT GGAGGACAAG GATGCTGTGG CCTTAACCTG 
501 TGAACCTGAG ACTCAGGACA CAACCTACCT GTGGTGGGTA AACAATCAGA 
5 51 GCCTCCCGGT CAGTCCCAGG CTGGAGCTGT CCAGTGACAA CAGGACCCTC 
601 ACTGTATTCA ATATTCCAAG AAATGACACA ACATCCTACA AATGTGAAAC 
651 CCAGAACCCA GTGAGTGTCA GACGCAGCGA CCCAGTCACC CTGAACGTCC 
7 01 TCTATGGCCC GGATGCGCCC ACCATTTCCC CTCTAAACAC ACCTTACAGA 
751 G C AGGGG AAA ATCTGAACCT CACCTGCCAC GCAGCCTCTA ACCCAACTGC 
801 ACAGTACTTT TGGTTTGTCA ATGGGACGTT CCAGCAATCC AC AC AAGAG C 
851 TCTTTATACC CAACATCACC GTGAATAATA GCGGATCCTA TATGTGCCAA 
901 GCCCATAACT CAGCCACTGG CCTCAATAGG ACCACAGTCA CGGCGATCAC 
951 AGTCTACGCG GAGCTGCCCA AGCCCTACAT CACCAGCAAC AACTCCAACC 

1001 CCATAGAGGA CAAGGATGCT GTGACCTTAA CCTGTGAACC TGAGACTCAG 

1051 GACACAACCT ACCTGTGGTG GGTAAACAAT CAGAGCCTCT CGGTCAGTTC 

1101 * CAGGCTGGAG CTGTCCAATG ACAACAGGAC CCTCACTGTA TTCAATATTC 

1151 CAAGAAACGA CACAACGTTC TACGAATGTG AGACCCAGAA CCCAGTGAGT 

1201 GTCAGACGCA GCGACCCAGT CACCCTGAAT GTCCTCTATG GCCCGGATGC 

1251 GCCCACCATT TCCCCTCTAA ACACACCTTA CAGAGCAGGG GAAAATCTGA 

13 01 ACCTCTCCTG CCACGCAGCC TCTAACC C AG CTGCACAGTA CTCTTGGTTT 

13 51 GTCAATGGGA CGTTCCAGCA ATCCACACAA GAGCTCTTTA TACCCAACAT 

1401 CACCGTGAAT AATAGCGGAT CCTATATGTG CCAAGCCCAT AACTCAGCCA 

1451 CTGGCCTCAA TAGGACCACA GTCACGGCGA TCACAGTCTA TGTGGAGCTG 

1501 CCCAAGCCCT ACATCTCCAG CAACAACTCC AACCCCATAG AGGACAAGGA 

1551 TGCTGTGACC TTAACCTGTG AACCTGTGGC TGAGAACACA ACCTACCTGT 

1601 GGTGGGTAAA CAATCAGAGC CTCTCGGTCA GTCCCAGGCT GCAGCTCTCC 

1651 AATGGCAACA GGATCCTCAC TCTACTCAGT GTCACACGGA ATGACACAGG 

1701 AC CCT ATGAA TGTGGAATCC AGAACTCAGA GAGTGCAAAA CGCAGTGACC 

1751 CAGTCACCCT GAATGTCACC TATGGCCCAG ACACCCCCAT CATATCCCCC 

1801 C C AG ACTTGT CTTACC GTTC GGGAGCAAAC CTCAACCTCT CCTGCCACTC 

1851 GGACTCTAAC CCATCCCCGC AGTATTCTTG GCTTATCAAT GGGACACTGC 

19 01 GGCAACACAC ACAAGTTCTC TTTATCTCCA AAATCACATC AAACAATAGC 

1951 GGGGCCTATG CCTGTTTTGT CTCTAACTTG GCT AC CGGTC GCAATAACTC 

2 001 CATAGTCAAG AACATCTCAG TCTCCTCTGG CGATTCAGCA CCTGGAAGTT 

2051 CTGGTCTCTC AGCTAGGGCT ACTGTCGGCA TCATAATTGG AATGCTGGTT 

2101 GGGGTTGCTC TGATGTAG (SEQ ID NO: 31) 
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1 ATGGGGTCTC CCTCAGCCCC TCTTCACAGA TGGTGCATCC CCTGGCAGAC 

51 GCTCCTGCTC ACAGCCTCAC TTCT AAC CTT CTGGAACCCG CCCACCACTG 

101 CCCAGCTCAC TATTGAATCC AGGCCGTTCA ATGTTGC AG A GGGGAAGGAG 

151 GTTCTTCTAC TTGCCCACAA TGTGTCCCAG AATCTTTTTG GCTACATTTG 

201 GTACAAGGGA GAAAGAGTGG ATGCCAGCCG TCGAATTGGA TCATGTGTAA 
251 ■ TAAGAACTCA ACAAATTACC CCAGGGCCCG CACACAGCGG TCGAGAGACA 

3 01 ATAGACTTCA ATGCATCCCT GCTGATC C AC AATGTCACCC AGAGTGACAC 
351 AGGATCCTAC AC C AT AC AAG TCATAAAGGA AGATCTTGTG AATGAAGAAG 
401 C AACTGGC C A GTTCCGGGTA TACCCGGAGC TGCCCAAGCC CTACATCTCC 

4 51 AGCAACAACT CCAACCCCGT GGAGG AC AAG GATGCTGTGG CCTTAACCTG 
501 TGAAC CTG AG ACTCAGGACA CAACCTACCT GTGGTGGGTA AACAATCAGA 

5 51 GCCTCCCGGT CAGTCCCAGG CTGGAGCTGT CCAGTGACAA CAGGACCCTC 

6 01 ACTGTATTCA ATATTCCAAG AAATGACACA ACATCCTACA AATGTGAAAC 
651 CCAGAACCCA GTGAGTGTCA GACGCAGCGA CCCAGTCACC CTGAACGTCC 

7 01 TCTATGGCCC GGATGCGCCC ACCATTTCCC C TCTAAAC AC ACCTTACAGA 
7 51 GCAGGGGAAA ATCTGAACCT CACCTGCCAC GCAGCCTCTA ACCCAACTGC 
801 ACAGTACTTT TGGTTTGTCA ATGGGACGTT CCAGCAATCC ACACAAGAGC 
851 TCTTTATACC CAACATCACC GTGAATAATA GCGGATCCTA TATGTGCCAA 
9 01 GCCCATAACT CAGCCACTGG CCTCAATAGG AC C AC AGTC A CGGCGATCAC 
951 AGTCTACGCG GAGCTGCCCA AGCCCTACAT CACCAGCAAC AACTCCAACC 

1001 CCATAGAGGA CAAGGATGCT GTGACCTTAA CCTGTGAACC TGAGACTCAG 

1051 GACACAACCT AC CTGTGGTG GGTAAACAAT CAGAGCCTCT CGGTCAGTTC 

1101 CAGGCTGGAG CTGTCCAATG ACAACAGGAC CCTCACTGTA TTCAATATTC 

1151 CAAGAAACGA CACAACGTTC TACGAATGTG AGACCCAGAA CCCAGTGAGT 

1201 GTCAGACGCA GCGACCCAGT CACCCTGAAT GTCCTCTATG GCCCGGATGC 

1251 GCCCACCATT TCCCCTCTAA AC AC AC CTT A CAGAGCAGGG GAAAATCTGA 

13 01 ACCTCTCCTG CCACGCAGCC TCTAACCCAG CTGCACAGTA CTTTTGGTTT 

13 51 GTCAATGGGA CGTTCCAGCA ATCCACACAA GAGCTCTTTA TAC CC AAC AT 

1401 CACCGTGAAT AATAGCGGAT CCTATATGTG CCAAGCCCAT AACTCAGCCA 
1451 CTGGCCTCAA TAGGACCACA GTCACGGCGA TCACAGTCTA TGTGGAGCTG 

1501 CCCAAGCCCT ACATCTCCAG CAACAACTCC AACCCCATAG AGGACAAGGA 

1551 TGCTGTGACC TTAACCTGTG AACCTGTGGC TGAGAACACA ACCTACCTGT 

1601 GGTGGGTAAA CAATCAGAGC CTCTCGGTCA GTCCCAGGCT GCAGCTCTCC 

1651 AATGGCAACA GGATCCTCAC TCTACTCAGT GTCACACGGA ATGACACAGG 

17 01 AC CCTATGAA TGTGGAATCC AGAACTCAGA GAGTGCAAAA CGCAGTGACC 

17 51 CAGTCACCCT GAATGTCACC TATGGCCCAG ACACCCCCAT CATATCCCCC 

1801 CCAGACTTGT CTTACCGTTC GGGAGCAAAC CTCAACCTCT CCTGCCACTC 

1851 GGACTCTAAC CCATCCCCGC AGTATTCTTG GCTTATCAAT GGGACACTGC 

1901 GGCAACACAC ACAAGTTCTC TTTATCTCCA AAATCACATC AAACAATAAC 

19 51 GGGGCCTATG CCTGTTTTGT CTCTAACTTG GCTACCGGTC GCAATAACTC 

2001 C ATAGTC AAG AACATCTCAG TCTCCTCTGG CGATTCAGCA CCTGGAAGTT 

2051 CTGGTCTCTC AGCTAGGGCT ACTGTCGGCA TCATAATTGG AATGCTGGTT 

2101 GGGGTTGCTC TGATGTAG (SEQ ID NO: 32) 

FIGURE 14B 
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1 MGSPSAPLHR WCIPWQTLLL TASLLTFWNP PTTAQLTIES RPFNVAEGKE 

51 VLLLAHNVSQ NLFGYIWYKG ERVDASRRIG SCVIRTQQIT PGPAHSGRET 
101 •' IDFNASLLIH NVTQSDTGSY TIQVIKEDLV NEEATGQFRV YPELPKPYIS 

151 SNNSNPVEDK DAVALTCEPE TQDTTYLWWV NNQSLPVSPR LELSSDNRTL 

201 TVFNIPRNDT TSYKCETQNP VSVRRSDPVT LNVLYGPDAP TISPLNTPYR 

251 AGENLNLTCH AASNPTAQYF WFVNGTFQQS TQELFIPNIT VNNSGSYMCQ 

301 AHNSATGLNR TTVTAITVYA ELPKPYITSN NSNPIEDKDA VTLTCEPETQ 

3 51 DTTYLWWVNN QSLSVSSRLE L SNDNRTLTV FNIPRNDTTF YECETQNPVS 

401 VRRSDPVTLN VLYGPDAPTI SPLNTPYRAG ENLNLSCHAA SNPAAQYSWF 

451 VNGTFQQSTQ ELFIPNITVN NSGSYMCQAH NSATGLNRTT VTAITVYVEXi 

501 PKPYISSNNS NPIEDKDAVT LTCEPVAENT TYLWWVNNQS LSVSPRLQLS 

551 NGNRILTLLS VTRNDTGPYE CGIQNSESAK RS D PVTLNVT YGPDTPIISP 

601 PDLSYRSGAN LNLSCHSDSN PSPQYSWLIN GTLRQHTQVL FISKITSNNS 

651 G AYAC FVSNL» ATGRNNSIVK NISVSSGDSA PGSSGLSARA TVGIIIGMLV 

701 GVALM ( SEQ ID NO:33.) 

FIGURE 14C 

1 MGSPSAPLHR WCIPWQTLLL TASLLTFWNP PTTAQLTIES RPFNVAEGKE 

51 VLLLAHNVSQ NLFGYIWYKG ERVDASRRIG SCVIRTQQIT PGPAHSGRET 

101 IDFNASLLIH NVTQSDTGSY TIQVIKEDLV NEEATGQFRV YPELPKPYIS 

151 SNNSNPVEDK DAVALTCEPE TQDTTYLWWV NNQSLPVSPR LELSSDNRTL 

201 TVFNIPRNDT TSYKCETQNP VSVRRSDPVT LNVLYGPDAP TISPLNTPYR 

251 AGENLNLTCH AASNPTAQYF WFVNGTFQQS TQELFIPNIT VNNSGSYMCQ 

301 AHNS ATGLNR TTVTAITVYA ELPKPYITSN NSNPIEDKDA VTLTCEPETQ 

351 DTTYLWWVNN QSLSVSSRLE LSNDNRTLTV FNIPRNDTTF YECETQNPVS 

401 VRRSDPVTLN VLYGPDAPTI SPLNTPYRAG ENLNLSCHAA SNPAAQYFWF 

451 VNGTFQQSTQ ELF I PNITVN NSGSYMCQAH NSATGLNRTT VTAITVYVEL 

501 PKPYISSNNS NPIEDKDAVT LTCEPVAENT TYLWWVNNQS LSVSPRLQLS 

551 NGNRILTLLS VTRNDTGPYE CGIQNSESAK RS D PVTLNVT YGPDTPIISP 

601 PDLSYRSGAN LNLSCHSDSN PSPQYSWLIN GTLRQHTQVL FISKITSNNN 

651 GAYACFVSNL ATGRNNSIVK NISVSSGDSA PGSSGLSARA TVGIIIGMLV 

7 01 GVALM (SEQIDNO:34) 



FIGURE 14D 
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1 MESPSAPPHR WCIPWQRLLL 

51 VLLLVHNLPQ HLFGYSWYKG 

101 lYPNASLLIQ NIIQNDTGFY 

151 SNNSKPVEDK DAVAFTCEPE 

201 TLFNVTRNDT ASYKCETQNP 

251 SGENLNLSCH AASNPPAQYS 

301 AHNSDTGLNR TTVTTITVYA 

351 NTTYIjWVJVNN QSLPVSPRLQ 

401 VDHSDPVILN VLYGPDDPTI 

451 IDGNIQQHTQ ELFISNITEK 

501 PKPSISSNNS KPVEDKDAVA 

551 NGNRTLTLFN VTRNDARAYV 

601 PDSSYLSGAM LNLSCHSASN 

651 GTYACFVSNL ATGRISJNSIVK 

7 01 LI (SEQ ID NO:30) 



TASLLTFWNP PTTAKLTIES TPFNVAEGKE 
ERVDGNRQII GYVIGTQQAT PGPAYSGREI 
TLHVIKSDLV NEEATGQFRV YPELPKPSIS 
TQDATYLWWV NNQSLPVSFR LQLSNGNRTL* 
VSARRSDSVI LNVLYG PDAP TISPLNTSYR 
WFVNGTFQQS TQELFIPNIT VNNSGSYTCQ 
EPPKPFITSN NSNPVEDEDA VALTCEPEIQ 
LSNDNRTLTL LSVTRNDVGP YECGIQNELS 
SPSYTYYRPG VNLSLSCHAA SNPPAQYSWL 
NSGLYTCQAN NSASGHSRTT VKTITVSAEL 
FTCEPEAQNT TYLWWVNGQS LPVSPRLQLS 
CGIQNSVSAN RSDPVTbDVI. YGPDTPIISP 
PSPQYSWRIN G I PQQHTQVti FIAKITPNNN 
SITVSASGTS PGIiSAGATVG IMIGVLVGVA 
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1 ATGGGCAGCC CCAGCGCCCC CCTGCACCGC TGGTGCATCC CCTGGCAGAC 

CCTGCTGCTG ACCGCCAGCC TGCTGACCTT CTGGAACCCC CCCACCACCG 
101 CCCAGCTGAC CATCGAGAGC CGCCCCTTCA ACGTGGCCGA GGGCAAGGAG 
GTGCTGCTGC TGGCCCACAA CGTGAGCCAG AACCTGTTCG GCTACATCTG 
201 GTACAAGGGC GAGCGCGTGG ACGCCAGGCG CCGCATCGGC AGCTGCGTGA 
TCCGCACCCA GCAGATCACC CCCGGCCCCG CCCACAGCGG CCGCGAGACC 
301 ATCGACTTCA ACGCCAGCCT GCTGATCCAC AACGTGACCC AGAGCGACAC 
CGGCAGCTAC ACCATCCAGG TGATCAAGGA GGACCTGGTG AACGAGGAGG 
401 CCACCGGCCA GTTCCGCGTG TACCCCGAGC TGCCCAAGCC CTACATCAGC 
AGCAACAACA GCAACCCCGT GGAGGACAAG GACGCCGTGG CCCTGACCTG 
501 CGAGCCCGAG ACCCAGGACA CCACCTACCT GTGGTGGGTG AACAACCAGA 
GCCTGCCCGT GAGCCCCCGC CTGGAGCTGA GCAGCGACAA CCGCACCCTG 
601 ACCGTGTTCA ACATCCCCCG CAACGACACC ACCAGCTACA AGTGCGAGAC 
CCAGAACCCC GTGAGCGTGC GCCGCAGCGA CCCCGTGACC CTGAACGTGC 
7 01 TGTACGGCCC CGACGCCCCC ACCATCAGCC CCCTGAACAC CCCCTACCGC 
GCCGGCGAGA ACCTGAACCT GACCTGCCAC GCCGCCAGCA ACCCCACCGC 
801 CCAGTACTTC TGGTTCGTGA ACGGCACCTT CCAGCAGAGC ACCCAGGAGC 
TGTTCATCCC CAACATCACC GTGAACAACA GCGGCAGCTA CATGTGCCAG 
9 01 GCCCACAACA GCGCCACCGG CCTGAACCGC ACCACCGTGA CCGCCATCAC 
CGTGTACGCC GAGCTGCCCA AGCCCTACAT CACCAGCAAC AACAGCAACC 
1001 CCATCGAGGA CAAGGACGCC GTGACCCTGA CCTGCGAGCC CGAGACCCAG 
GACACCACCT ACCTGTGGTG GGTGAACAAC CAGAGCCTGA GCGTGAGCAG 
1101 CCGCCTGGAG CTGAGCAACG ACAACCGCAC CCTGACCGTG TTCAACATCC 
CCCGCAACGA CACCACCTTC TACGAGTGCG AGACCCAGAA CCCCGTGAGC 
1201 GTGCGCCGCA GCGACCCCGT GACCCTGAAC GTGCTGTACG GCCCCGACGC 
CCCCACCATC AGCCCCCTGA ACACCCCCTA CCGCGCCGGC GAGAACCTGA 
13 01 ACCTGAGCTG CCACGCCGCC AGCAACCCCG CCGCCCAGTA CAGCTGGTTC 
GTGAACGGCA CCTTCCAGCA GAGCACCCAG GAGCTGTTCA TCCCCAACAT 
1401 C AC CGTGAAC AACAGCGGCA GCTACATGTG CCAGGCCCAC AACAGCGCCA 
CCGGCCTGAA CCGCACCACC GTGACCGCCA TCACCGTGTA CGTGGAGCTG 
1501 CCCAAGCCCT ACATCAGCAG CAACAACAGC AACCCCATCG AGGACAAGGA 
CGCCGTGACC CTGACCTGCG AGCCCGTGGC CGAGAACACC ACCTACCTGT 
1601 GGTGGGTGAA CAACCAGAGC CTGAGCGTGA GCCCCCGCCT GCAGCTGAGC 
AACGGCAACC GCATCCTGAC CCTGCTGAGC GTGACCCGCA ACGACACCGG 
1701 CCCCTACGAG TGCGGCATCC AGAACAGCGA GAGCGCCAAG CGCAGCGACC 
CCGTGACCCT G AACGTG AC C TACGGCCCCG ACACCCCCAT CATCAGCCCC 
18 01 CCCGACCTGA GCTACCGCAG CGGCGCCAAC CTGAACCTGA GCTGCCACAG 
CGACAGCAAC CCCAGCCCCC AGTACAGCTG GCTGATCAAC GGCACCCTGC 
1901 GCCAGCACAC CCAGGTGCTG TTCATCAGCA AGATCACCAG CAACAACAGC 
GGCGCCTACG CCTGCTTCGT GAGCAACCTG GCCACCGGCC GCAACAACAG 
2001 CATCGTGAAG AACATCAGCG TGAGCAGCGG CGACAGCTCT AGAAGCACCC 
CCATCCCATT CAGCTACAGC AAGAACCTGG ACTGCTGGGT GGACAACGAG 
2101 GAGGACATCG ACGTGATCCT GAAGAAGAGC ACCATCCTGA ACCTGGACAT 
CAACAAC G AC ATCATCAGCG ACATCAGCGG CTTCAACAGC AGC GTGATCA 
2201 CCTACCCCGA CGCCCAGCTG GTGCCCGGCA TCAACGGCAA GGCCATCCAC 
CTGGTGAACA ACGAGAGCAG CGAGGTGATC GTGCACAAGG CCATGGACAT 
2301 CGAGTACAAC GACATGTTCA ACAACTTCAC CGTGAGCTTC TGGCTGAGAG 
TGCCTAAGGT GAGCGCCAGC CACCTGGAGC AGTACGGCAC CAACGAGTAC 
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2401 AGCATCATCA GCAGCATGAA GAAGCACAGC CTGAGCATCG GCAGCGGCTG 

GAGCGTGAGC CTGAAGGGCA ACAACCTCAT CTGGACCCTG AAGGATAGCG 
2501 CCGGAGAGGT GAGACAGATC ACCTTCAGAG ACCTGCCCGA CAAGTTCAAT 

GCCTACCTGG CCAACAAGTG GGTGTTCATC ACCATCACCA ACGACAG AC T 
2 601 GAGCAGCGCC AACCTGTACA TCAACGGCGT GCTCATGGGC AGCGCCGAGA 

TCACCGGCCT GGGCGCCATC AGAGAGGACA ACAACATCAC CCTGAAGCTG 
2701 GACAGATGCA ACAACAACAA CCAGTACGTG AGCATCGACA AGTTCCGGAT 

CTTCTGCAAG GCCCTGAACC C CAAGGAG AT CGAGAAGCTG TACACCAGCT 
2 801 ACCTGAGCAT CACCTTCCTG AGAGACTTCT GGGGCAACCC CCTGAGATAC 

GACACCTAG (SEQ ID NO: 35) 

FIGURE 15A 



! 



Copv orovided bv USPTO from the IFW Imaqe Database on 05/17/2005 



22/24 

1 MGSPSAPLHR WCIPWQTLLL, TASLLTFWNP PTTAQLTIES RPFNVAEGKE 

51 VLLLAHNVS Q NLFGYIWYKG ERVDASRRIG SCVIRTQQIT PGPAHSGRET 

101 IDFNASLLIH NVTQSDTGSY TIQVIKEDLV NEEATGQFRV YPELPKPYIS 

151 SNNSNPVEDK DAVALTCEPE TQDTTYLWWV NNQSLPVSPR LELSSDNRTL 

201 TVFNI PRNDT TSYKCETQNP VSVRRSDPVT LNVLYGPDAP TISPIiNTPYR 

251 AGENLNLTCH AASNPTAQYF WFVNGTFQQS TQELFIPNIT VNNSGSYMCQ 

301 AHNSATGLNR TTVTAITVYA ELPKPYITSN NSNPIEDKDA VTLTCEPETQ 

351 DTTYLWWVNN QSLSVSSRLE LSNDNRTLTV FNIPRNDTTF YECETQNPVS 

401 VRRSDPVTLN VLYGPDAPTI SPLNTPYRAG ENLNLSCHAA SNPAAQYSWF 

451 VNGTFQQSTQ ELFIPNITVN NSGSYMCQAH NSATGLNRTT VTAITVYVEL 

501 PKPYISSNNS NPIEDKDAVT LTCEPVAENT TYLWWVNNQS LSVSPRLQLS 

551 NGNRILTLLS VTRNDTGPYE CGIQNSESAK RSDPVTLNVT YGPDTPIISP 

601 PDLSYRSGAN LNLSCHSDSN PSPQYSWL.IN GTLRQHTQVL FISKITSNNS 

651 GAYACFVSNL ATGRNNSIVK NISVSSGDSS RSTPIPFSYS KNLDCWVDNE 

7 01 EDIDVILKKS TILNLDINND IISDISGFNS SVITYPDAQL* VPGINGKAIH 

751 LVNNESSEVI VHKAMDIEYN DMFNNF TVS F WLRVPKVSAS HLEQYGTNEY 

801 SIISSMKKHS LSIGSGWSVS LKGNNLIWTL KDSAGEVRQI TFRDLPDKFN 

851 AYLANKWVFI TITNDRLSSA NLYINGVLMG SAEITGLGAI REDNNITLKL 

901 DRCNNWNQYV SIDKFRIFCK ALNPKEIEKL YTSYLSITFL RDFWGKTPLRY 

951 DT* (SEQ ID NO: 36) 

FIGURE 15B 
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1 ATGGGCAGCC CCAGCGCCCC CCTGCACCGC TGGTGCATCC CCTGGCAGAC 

CCTGCTGCTG ACCGCCAGCC TGCTGACCTT CTGGAACCCC CCCACCACCG 
101 CCCAGCTGAC CATCGAGAGC CGCCCCTTCA ACGTGGCCGA GGGCAAGGAG 

GTGCTGCTGC TGGCCCACAA CGTGAGCCAG AACCTGTTCG GCTACATCTG 
2 01 GTACAAGGGC GAGCGCGTGG ACGCCAGCCG CCGCATCGGC AGCTGCGTGA 

TCCGCACCCA GCAGATCACC CCCGGCCCCG CCCACAGCGG CCGCGAGACC 
301 ATCGACTTCA ACGCCAGCCT GCTGATCCAC AACGTGACCC AGAGCGACAC 

CGGCAGCTAC ACCATCCAGG TGATCAAGGA GGACCTGGTG . AACGAGGAGG 
4 01 CCACCGGCCA GTTCCGCGTG TACCCCGAGC TGCCCAAGCC CTACATCAGC 

AGCAACAACA GCAACCCCGT GGAGGACAAG GACGCCGTGG CCCTGACCTG 
501 CGAGCCCGAG ACCCAGGACA CCACCTACCT GTGGTGGGTG AACAACCAGA 

GCCTGCCCGT GAGCCCCCGC CTGGAGCTGA GCAGCGACAA CCGCACCCTG 
601 ACCGTGTTCA ACATCCCCCG CAACGACACC ACCAGCTACA AGTGCGAGAC 

CCAGAACCCC GTGAGCGTGC GCCGCAGCGA CCCCGTGACC CTGAACGTGC 
701 TGTACGGCCC CGACGCCCCC ACCATCAGCC CCCTGAACAC CCCCTACCGC 

GCCGGCGAGA ACCTGAACCT GACCTGCCAC GCCGCCAGCA ACCCCACCGC 
801 CCAGTACTTC TGGTTCGTGA ACGGCACCTT CCAGCAGAGC ACCCAGGAGC 

TGTTCATCCC CAACATCACC GTGAACAACA GCGGCAGCTA C ATGTGC C AG 
901 GCCCACAACA GCGCCACCGG CCTGAACCGC ACCACCGTGA CCGCCATCAC 

CGTGTACGCC GAGCTGCCCA AGCCCTACAT CACCAGCAAC AACAGCAACC 
1001 CCATCGAGGA CAAGGACGCC GTGACCCTGA CCTGCGAGCC CGAGACCCAG 

GACACCACCT ACCTGTGGTG GGTGAACAAC CAGAGCCTGA GCGTGAGCAG 
1101 CCGCCTGGAG CTGAGCAACG ACAACCGCAC CCTGACCGTG TTCAACATCC 

CCCGCAACGA CACCACCTTC TACGAGTGCG AGACCCAGAA CCCCGTGAGC 
1201 GTGCGC CGC A GCGACCCCGT GACCCTGAAC GTGCTGTACG GCCCCGACGC 

CCCCACCATC AGCCCCCTGA ACACCCCCTA CCGCGCCGGC GAGAACCTGA 
1301 ACCTGAGCTG CCACGCCGCC AGCAACCCCG CCGCCCAGTA CAGCTGGTTC 

GTGAACGGCA CCTTCCAGCA GAGCACCCAG GAGCTGTTCA TCCCCAACAT 
1401 CACCGTGAAC AACAGCGGCA GCTACATGTG CCAGGCCCAC AACAGCGCCA 

CCGGCCTGAA CCGCACCACC GTGACCGCCA TCACCGTGTA CGTGGAGCTG 
1501 CCCAAGCCCT ACATCAGCAG CAACAACAGC AACCCCATCG AGGACAAGGA 

CGCCGTGACC CTGACCTGCG AGCCCGTGGC CGAGAAC AC C ACCTACCTGT 
1601 GGTGGGTGAA CAACCAGAGC CTGAGCGTGA GCCCCCGCCT GCAGCTGAGC 

AACGGCAACC GCATCCTGAC CCTGCTGAGC GTGACCCGCA ACGACACCGG 
1701 CCCCTACGAG TGCGGCATCC AGAACAGCGA GAGCGCCAAG CGCAGCGACC 

CCGTGACCCT GAACGTG AC C TACGGCCCCG ACACCCCCAT CATCAGCCCC 
1801 CCCGACCTGA GCTACCGCAG CGGCGCCAAC CTGAACCTGA GCTGCCACAG 

CGACAGCAAC CCCAGCCCCC AGTACAGCTG GCTGATCAAC GGCACCCTGC 
1901 GCCAGCACAC CCAGGTGCTG TTCATCAGCA AGATCACCAG CAACAACAGC 

GGCGCCTACG CCTGCTTCGT GAGCAACCTG GCCACCGGCC GCAACAACAG 
2001 CATCGTGAAG AACATCAGCG TGAGCAGCGG CGACAGCTCT AGAACCCCTC 

AGAACATCAC CGATCTGTGC GCCGAGTACC ACAACACCCA G ATC T AC AC C 
2101 CTGAACGACA AGATC TTCAG CTACACCGAG AGCCTGGCCG GCAAGAGAGA 

GATGGCCATC ATCACCTTCA AGAACGGCGC CATCTTCCAG GTGGAGGTGC 
22 01 CCGGCAGCCA GCACATCGAC AGCCAGAAGA AGGCCATCGA GCGGATGAAG 

GACACCCTGC GGATCGCCTA CCTCACCGAQ GCCAAGGTGG AGAAGCTGTG 
2301 CGTGTGGAAC AACAAGACCC CTCACGCCAT CGCCGCCATC AGCATGGCCA 

ATTGATAAG <SEQ ID NO: 37) 



FIGURE 16A 
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1 MGSPSAPLHR WCIPWQTLLL TASLLTFWNP PTTAQLTIES RPFNVAEGKE 

51 VLLLAHNVSQ NLFGYIWYKG ERVDASRRIG SCVIRTQQIT PGPAHSGRET 

101 IDFNASDLIH NVTQSDTGSY TIQVIKEDLV NEEATGQFRV YPELPKPYIS 

151 SNNSNPVEDK DAVALTCEPE TQDTTYLWWV NNQSLPVSPR LELSSDNRTI* 

201 TVFNIPRNDT TSYKCETQNP VSVRRSDPVT LNVTjYG PDAP TISPLNTPYR 

251 AGENLNLTCH AASNPTAQYF WFVNGTFQQS TQELFIPNIT VNNSGSYMCQ 

3 01 AHNSATGLNR TTVTAITVYA ELPKPYITSN NSNPIEDKDA VTLTCEPETQ 

351 DTTYLWWVNN QSLSVSSRLE LSNDNRTLTV FNIPRNDTTF YECETQNPVS 

401 VRRSDPVTLN VLYGPDAPTI SPLNTPYRAG ENLNLSCHAA SNPAAQY SWF 

451 VNGTFQQSTQ ELFIPNITVN NSGSYMCQAH NSATGLNRTT VTAITVYVEL 

501 PKPYISSNNS NPIEDKDAVT LTCEPVAENT TYLWWVNNQS LSVSPRLQLS 

551 NGNRI LTLLS VTRNDTGPYE CGIQNSESAK RSDPVTLiNVT YGPDTPIISP 

601 PDLSYRSGAN LNLSCHSDSN PSPQYSWLIN GTLRQHTQVL FISKITSNNS 

651 GAYACFVSNL ATGRNNS I VK NISVSSGDSS RTPQNITDLC AEYHNTQIYT 

7 01 LNDKIFSYTE SLAGKREMAI ITFKNGAIFQ VEVPGSQHID SQKKAIERMK 

751 DTLRIAYI/TE AKVEKLCVWN NKTPHAIAAI SMAN* * (SEQ ID NO: 38) 

FIGURE 16B 
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